Article 


N'-methylpseudouridylation of mRNA 
causes +1 ribosomal frameshifting 


https://doi.org/10.1038/s41586-023-06800-3 


Received: 25 January 2023 


Accepted: 31 October 2023 


Published online: 06 December 2023 


Open access 


Thomas E. Mulroney’, Tuija Pòyry', Juan Carlos Yam-Puc', Maria Rust’, Robert F. Harvey’, 
Lajos Kalmar’, Emily Horner’, Lucy Booth’, Alexander P. Ferreira’, Mark Stoneley', 
Ritwick Sawarkar', Alexander J. Mentzer’, Kathryn S. Lilley?, C. Mark Smales*^, 

Tobias von der Haar’, Lance Turtle®, Susanna Dunachie"*?, Paul Klenerman"^, 

James E. D. Thaventhiran"""" & Anne E. Willis'"™ 


® Check for updates 


In vitro-transcribed (IVT) mRNAs are modalities that can combat human disease, 


exemplified by their use as vaccines for severe acute respiratory syndrome coronavirus 
2 (SARS-CoV-2). IVT mRNAsare transfected into target cells, where they are translated 
into recombinant protein, and the biological activity or immunogenicity of the 
encoded protein exerts an intended therapeutic effect’”. Modified ribonucleotides 

are commonly incorporated into therapeutic IVT mRNAs to decrease their innate 
immunogenicity’, but their effects on mRNA translation fidelity have not been fully 
explored. Here we demonstrate that incorporation of M-methylpseudouridine into 
mRNA results in +1 ribosomal frameshifting in vitro and that cellular immunity in mice 
and humans to *1frameshifted products from BNT162b2 vaccine mRNA translation 
occurs after vaccination. The +1 ribosome frameshifting observed is probably a 
consequence of M-methylpseudouridine-induced ribosome stalling during IVT mRNA 
translation, with frameshifting occurring at ribosome slippery sequences. However, 
we demonstrate that synonymous targeting of such slippery sequences provides an 
effective strategy to reducethe production of frameshifted products. Overall, these 
data increase our understanding of how modified ribonucleotides affect the fidelity 

of mRNA translation, and although there are no adverse outcomes reported from 
mistranslation of mRNA-based SARS-CoV-2 vaccines in humans, these data highlight 
potential off-target effects for future mRNA-based therapeutics and demonstrate the 
requirement for sequence optimization. 


A key feature of therapeutic IVT mRNAs is that they contain modi- 
fied ribonucleotides, which have been shown to decrease innate 
immunogenicity and can additionally increase mRNA stability, both 
of which are favourable characteristics for mRNA therapies! ©. For 
example, clinically approved SARS-CoV-2 mRNA vaccines incorpo- 
rate N-methylpseudouridine (1-methylY), which has been shown to 
decrease IVT mRNA innate immunogenicity’ *. Some modified ribo- 
nucleotides, such as 5-methylcytidine (5-methylC), are naturally occur- 
ring post-transcriptional mRNA modifications in eukaryotes, whereas 
others are not, such as 1-methyl (refs. 6-10). 

We investigated how 5-methoxyuridine (5-methoxyU), 5-methylC 
and 1-methyl affect translation of IVT mRNA. 5-methoxyU, 5-methyIC 
and 1-methyl have been utilized in IVT mRNAs to attempt to increase 
recombinant protein synthesis in vitro, and for preclinical proof of 
concept for IVT mRNA-basedtherapies!!?. As mentioned, 1-methylU is 
aribonucleotide incorporated in licensed IVT mRNA-based SARS-CoV-2 


vaccines, but also mRNA-based human vaccines and therapies in 
development’ ”*., 

Despite their widespread use, surprisingly little is known about how 
ribonucleotide modification affects protein synthesis, particularly 
for translation of therapeutic IVT mRNAs. We were interested in how 
modified ribonucleotides affect the fidelity of mRNA translation for 
several reasons. Certain ribonucleotide modifications can recode mRNA 
sequences (for example, inosine”). 5-methylC has previously been shown 
toincrease misreading during mRNA translation in prokaryotes, but its 
effect on eukaryotic mRNA translation fidelity has not been explored”. 
The effect of 5-methoxyU on translation fidelity has not been investi- 
gated. Pseudouridine (W) is known to increase misreading of mRNA stop 
codons in eukaryotes, and can affect misreading during prokaryotic 
mRNA translation!'* 5, ]-methylU does not seemto affect codon misread- 
ing, but has been shown to affect protein synthesis rates and ribosome 
density on mRNAs, suggesting a direct effect on mRNA translation 9, 
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At present, itis unclear which modified ribonucleotides affect mRNA 
translation fidelity and existing studies are mostly limited to under- 
standing misreading frequencies only at a given codon. Misreading of 
mRNA codonsis also only one type of post-transcriptional mechanism 
that can alter a polypeptide sequence. So far, no study has investigated 
the fundamental question of whether modified ribonucleotides can 
affect the maintenance of the correct reading frame during translation 
of asynthetic transcript. Understanding these processes is critical to 
increase our knowledge of protein synthesis from modified mRNAs in 
general, but is also imperative for the robust design and evaluation of 
new mRNA-based therapeutics that make use of modified ribonucleo- 
tides within widely differing RNA sequences or therapeutic contexts. 

To investigate how ribonucleotide modification affects reading frame 
maintenance during translation of mRNA, we designed and synthesized 
IVT mRNAs (Fluc+1FS) that report on out-of-frame protein synthesis 
(Fig. la). Fluc-IFS mRNAs encode an amino-terminal segment of firefly 
luciferase (NFluc) and a complementary carboxy-terminal segment of 
Fluc (CFluc), directly downstream. CFluc is encoded in the +1 reading 
frame. Fluc+1FS mRNAs are designed to produce catalytically inactive 
(truncated) NFluc when translated normally. However, if ribosomes 
move out of frame during translation, elongated polypeptides contain- 
ing residues from both in-frame NFluc and out-of-frame CFluc can be 
produced, which can increase catalytic activity. 

We synthesized unmodified Fluc+1FS mRNAs, which contain canoni- 
cal ribonucleotides, and translated them in vitro. We confirmed that 
Fluc+1FS mRNAs produce catalytically inactive NFluc (Extended Data 
Fig. 1). By comparison, unmodified wild-type (WT) Fluc mRNA, con- 
taining the complete in-frame Fluc coding sequence, produced the 
expected active protein (Extended DataFig. 1). Then we synthesized and 
translated each mRNA containing 5-methoxyU, 5-methylC, 1-methylW, 
5-methoxyU + 5-methylC or 1-methylW + 5-methylC. Translation of 
WT Fluc mRNA was not significantly affected by either 1-methyl or 
5-methylC modifications alone, but was decreased by incorporating 
both ribonucleotides into a single transcript (Fig. 1b). 5-methoxyU incor- 
poration alone, or combined with 5-methylC, significantly decreased 
translation of WT Fluc mRNA (Fig. 1b). Incorporation of 1-methylW 
in Fluc+1FS mRNA significantly increased ribosomal +1 frameshift- 
ing to about 8% of the corresponding in-frame protein, which was not 
observed for other ribonucleotides (Fig. 1c). HeLa cells transfected 
with 1-methylW Fluc+1FS mRNA recapitulated the results from in vitro 
translation (Fig. 1d). On the basis of these observations, we concluded 
that IVT mRNA containing 1-methylY or 5-methylC exhibits similar 
translation efficiency to unmodified mRNA, but 1-methyIU significantly 
increases ribosomal +1 frameshifting during mRNA translation. 

We observed a large increase in ribosomal +1 frameshifting during 
translation of 1-methylY mRNA and reasoned that gaining better 
understanding of the translation products would complement the 
reporter assay data and help to explain how +1 frameshifted products 
originate. To address these aspects, we probed the polypeptides pro- 
duced during IVT mRNA translation by western blotting. Translation of 
unmodified Fluc+1FS mRNA produced the expected in-frame truncated 
product, which was also true for 5-methylC mRNA (Fig. 1e). Translation 
of 1-methylW mRNA produced the expected in-frame product, but also 
produced two additional bands at higher molecular weight (Fig. le). 
We reasoned that these products were +1 frameshifted polypeptides. 
We also confirmed that 1-methylW + 5-methylC-, 5-methoxyU- and 
5-methoxyU + 5-methylC mRNAs were comparatively poor mRNA tem- 
plates for protein synthesis (Fig. 1e). 

1-methyIW is also used in clinically approved SARS-CoV-2 mRNA 
vaccines?“. As 1-methyl increased +1 ribosome frameshifting during 
translation in vitro, we investigated whether this occurs in vivo for 
BNT162b2, a SARS-CoV-2 mRNA vaccine containing 1-methylW. We 
reasoned that +1 ribosomal frameshifting during recombinant anti- 
gen mRNA translation could lead to presentation of +1 frameshifted 
products to T cells, and elicit off-target cellular immune responses 
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Fig.1|Translation of I-methyIU-modified mRNA produces +1 frameshifted 
polypeptides. a, Structures of IVT mRNA transcripts used to probe protein 
synthesis fidelity. WT Fluc contains only (in-frame) Fluc coding sequence. 

For Fluc+1FS, the green segment represents in-frame N-terminal Fluc coding 
sequence (NFluc), and the orange segment represents +1 frameshifted 
C-terminal Fluc coding sequence (CFluc). Asterisk represents a premature 
stop codon. b, Luciferase activity produced by translation of WT Fluc mRNAs, 
either unmodified control (canonical nucleotides), or containing 1-methylU 
(m1U), 5-methylC (m5C), 5-methoxyU (mo5U) or the combinations indicated. 
**P < 0.01 (I-methylU + 5-methylC, P= 0.0051; 5-methoxyU, P= 0.0023; 
5-methoxyU + 5-methylC, P= 0.0042; one-way analysis of variance (ANOVA) 
with Dunnett's test). c, Luciferase activity produced by translation of modified 
Fluc+1FS mRNAs and unmodified control. 1-methylW, P = 0.002 (one-way 
ANOVA with Dunnett's test). d, Luciferase activity inlysates produced by 
transfection of HeLa cells with unmodified or 1-methylY Fluc+1FS mRNA for 
8h.P- 0.0104 (Welch's one-tailed t-test). e, Western blot analysis (anti-Flag 
epitope) of polypeptides produced by translation of mRNAs inc. All data are 
obtained from n =3 replicated experiments. e shows a single blot fromn =3 
replicated experiments. Asterisks represent bands at higher molecular weight. 
For gel source data, see Supplementary Fig. 2. 


(Fig. 2a). Antigen presentation from mistranslation of endogenous 
tumour mRNA has been shown to occur in vivo (for example, ref. 21). To 
address this possibility, we vaccinated mice with BNT162b2 and quanti- 
fied their T cell response to in-frame SARS-CoV-2 spike protein and +1 
frameshifted products predicted to occur by translation ofthe mRNA +1 
frame, as well as an unrelated control antigen (SARS-CoV-2 M protein), 
by interferon-y (IFNy) ELISpot assay. Junction peptides consisting of 
in-frame N-terminal residues and C-terminal +1 frameshifted residues 
were not included. We found that responses to +1 frameshifted spike 
peptides were significantly increased in vaccinated mice compared to 
untreated mice or those vaccinated with ChAdOx nCoV-19, which does 
not produce antigen from translation of N'-methylpseudouridylated 
mRNA? (Fig. 2b). Both BNT162b2 and ChAdOx1 nCoV-19 vaccination 
produced ELISpot responses to in-frame SARS-CoV-2 spike (Fig. 2c). 
These data suggest that +1 frameshifted products encoded in BNT162b2 
spike mRNA are T cell antigens for inbred mice, to which off-target 
immunity can be detected following vaccination. 
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Fig. 2|+1frameshifted products elicit off-target cellular immune 
responses following modified mRNA vaccination. a, Depiction of spike and 
+1 frameshifted (+1FS) products produced by 1-methylW-modified spike mRNA 
translation. CDS, coding sequence. b, Splenocyte IFNy ELISpot responses 
from untreated, ChAdOx1 nCoV-19-vaccinated or BNT162b2-vaccinated mice 
stimulated with +1FS spike peptides. IFNy ELISpot response from BNT162b2- 
vaccinated mice stimulated with SARS-CoV-2 M peptides (unrelated control 
antigen) is included for additional comparison. SFU, spot-forming units. 

Each groupn =8. Untreated versus ChAdOx1 nCoV-19, P= 0.963; untreated 
versus BNT162b2, P= 0.0005; ChAdOx1 nCoV-19 versus BNT162b2, P= 0.001. 
c, Splenocyte IFNy ELISpot responses from mice in b stimulated with spike 
peptides. Untreated versus ChAdOx1 nCoV-19, P= 2.05 x 10 *; untreated versus 
BNT162b2, P= 4.5 x 10; ChAdOx1 nCoV-19 versus BNT162b2, P- 1.88 x 10. 


We then compared IFNy ELISpot responses to predicted +1 frame- 
shifted SARS-CoV-2 spike protein products in 21 individuals vaccinated 
with BNT162b2 and compared these responses to those of 20 individu- 
als vaccinated with ChAdOx1 nCoV-19, none of whom reported undue 
effects as a result of vaccination. We detected a significantly higher 
IFNy response to +1 frameshifted antigen in the BNT162b2 vaccine 
group, compared to ChAdOx1 nCoV-19 (Fig. 2d). There was no asso- 
ciation between T cell responses to +1 frameshifted antigen and age, 
sex or HLA subtype (Supplementary Table 1 and Extended Data Figs. 2 
and 3). Both ChAdOx1 nCoV-19 and BNT162b2 vaccination produced 
ELISpot responses to in-frame SARS-CoV-2 spike, but responses to 
+1frameshifted products were observed only in individuals vaccinated 
with BNT162b2 (Fig. 2e,f). During SARS-CoV-2 viral replication, a pro- 
grammed —1ribosomal frameshift occurs naturally during translation 
of open reading frame (ORF) 1a and ORF 1b (ref. 23). It is not feasible 
that these data are a consequence of natural SARS-CoV-2 infection 
for the following, non-exhaustive, reasons. First, no frameshifting 
activity is known to occur during SARS-CoV-2 spike subgenomic mRNA 
translation (which would bea major discovery in its own right). Second, 
-1 frameshifting (and not +1 frameshifting) is restricted to a single 
programmed site in ORFla and ORFIb (ref. 23). Third, +1 frameshifted 


d, Peripheral blood mononuclear cells (PBMC) IFNy ELISpot responses 

from donors vaccinated with ChAdOx1 nCoV-19 (n = 20) or BNT162b2 (n = 21) 
stimulated with +1FS spike peptides. P= 0.0233 (Welch’s one-tailed t-test). 

e, PBMC IFNy ELISpot responses from donors inc stimulated with in-frame 
spike peptides: total spike pool or spike S1 + S2 subpools. ChAdOx1 nCoV-19 
spike versus BNT162b2 spike, P= 0.371; ChAdOx1 nCoV-19 spike versus 
BNT162b2 S1 + S2, P= 0.0845; BNT162b2 spike versus BNT162b2 S1 + S2, 
P=0.686.f, Representative images of PBMC IFNy ELISpot response wells for 
two individuals vaccinated with either BNT162b2 responder (top) or ChAdOx1 
nCoV-19 (bottom). Left to right: in-frame spike response (spike peptides); +1FS 
spike response (+1FS spike peptides); no peptide control. Pvalues in b,c,e were 
determined by one-way ANOVA and Tukey's test. 


peptides are predicted from the BNT162b2 mRNA sequence, and notthe 
Sgenesequence from wild virus (Extended Data Fig. 4). Instead, these 
data suggest that vaccination with I-methylU mRNA can elicit cellular 
immunity to peptide antigens produced by +1 ribosomal frameshifting 
in both major histocompatibility complex (MHC)-diverse people and 
MHC-uniform mice. 

To provide further mechanistic insight into +1 ribosome frameshift- 
ing during translation of 1-methylW mRNA, and identify potential 
frameshift sites or sequences, we translated 1-methylU Fluc+1FS 
mRNA, purified the major putative +1 frameshifted polypeptide and 
carried out liquid chromatography tandem mass spectrometry (LC- 
MS/MS) of tryptic digests. From this single polypeptide, we identified 
six in-frame peptides and nine peptides derived from the mRNA +1 
frame (Fig. 3a and Extended Data Table 1). All in-frame peptides were 
mapped to the N-terminal region, whereas +1 frameshifted peptides 
were mapped downstream (Fig. 3a). We then repeated this analysis 
using a different protease and identified a junction peptide spanning 
the main frame and the +1 frame (Fig. 3b). These data demonstrated 
that the elongated polypeptide was indeed a chimeric polypeptide 
consisting of in-frame N-terminal residues and +1 frameshifted 
C-terminal residues. As expected, shorter frameshifted products 
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Fig. 3 | Mistranslation of 1-methylW mRNA is due to +1 ribosomal 
frameshifting and not transcriptional errors. a, Tryptic peptide coverage 
plot of the purified high molecular weight polypeptide produced by translation 
of 1-methylW Fluc+1FS mRNA, showing in-frame residues (top) and +1 
frameshifted residues (bottom). —log,)[PEP] is the mass spectrum percolator 
score (only high-quality peptides are shown). IP, immunoprecipitate. The 
structure of Fluc+1FS mRNA from Fig. lis re-displayed and a western blot of 


were also produced from translation of 1-methylY mRNA encoding 
full-length Fluc (Extended Data Fig. 5). 

Apparenterrors in protein synthesis, including frameshifting, can 
be consequences of DNA mutation or transcriptional errors?*. Hence, 
faithful translation of anincorrect mRNA sequence can produce incor- 
rect proteins. In vitro transcripts are presumed to be exact RNA copies 
oftemplate DNA, the accuracy of which may be estimated by the fidel- 
ity of the used RNA polymerase. However, the substitution of canonical 
substrate ribonucleoside triphosphates for modified nucleotides 
may increase transcriptional errors. To address this possibility, we 
carried out high-throughput RNA sequencing of unmodified and 
1-methylU Fluct1FS mRNA and quantified nucleotide insertions and 
deletions in each population of IVT mRNA. Nucleotide deletion pro- 
files for each mRNA were very similar (Fig. 3c), as were nucleotide 
insertions (Fig. 3d), suggesting few site-specific differences. The over- 
all frequency of insertions and deletions was low, and did not differ 
significantly between unmodified and 1-methylU mRNA (Extended 
Data Table 2), which is supported by recent observations”. From these 
findings, we concluded that frameshifted products of 1-methylW 
mRNA translation were not due to transcriptional errors, but were 
due to bona fide ribosomal +1 frameshifting—a post-transcriptional 
mechanism. 

Ribosome frameshifting is a well-documented phenomenon that 
occurs during translation of many naturally occurring mRNAs”. As 
ribosome stalling is implicated in several instances of +1 frameshift- 
ing, we queried how the presence of 1-methylIU in IVT mRNA affects 
translation elongation?”?". To do this, we assayed protein synthesis 
during translation of unmodified or 1-methylY WT Fluc mRNA using 
co-translational [^S]methionine labelling”. Translation elongation 
of 1-methylW mRNA was slower than for unmodified mRNA (Fig. 4a), 
which is supported by previous observations”. All reactions were 
run for 30 min and there was less full-length protein produced from 
the translation of 1-methylW-containing mRNAs, suggesting a slower 
elongation rate compared to that of unmodified mRNA, with a greater 
proportion of premature polypeptide products. These data suggested 
that elongating ribosomes stall during translation of mRNA contain- 
ing I-methylU. 
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the translation reaction before immunoprecipitation is displayed. For gel 
source data, see Supplementary Fig. 3. b, Junction peptide derived from 

+1 ribosomal frameshifting and the originating mRNA sequence. c, Nucleotide 
deletions in unmodified (top) and 1-methylW (bottom) Fluc+1FS mRNA, 
quantified by n = 3 RNA-sequencing analyses. d, Nucleotide insertions in 
unmodified (top) and 1-methyl (bottom) Fluc+1FS mRNA. 


It was unclear whether 1-methylU affected mRNA decoding rates, or 
another process, during elongation. We reasoned that slower decoding 
of 1-methylW codons during translation elongation could lead to ribo- 
some stalling, similar to previous observations for 'hungry' codons 
at sites of +1 frameshifting during translation of naturally occurring 
mRNA’, We probed the molecular mechanism of ribosome stalling 
during 1-methylU mRNA translation using the aminoglycoside paro- 
momycin. In brief, during mRNA decoding, cognate aminoacyl-tRNA 
anticodon-codon interaction causes local conformational changes in 
18S rRNA (in eukaryotes), after which a new peptide bond is formed, 
ribosome subunit rotation occurs, and subsequent ribosome con- 
formational changes, elongation factor 2 binding and translocation 
to the next codon completes the elongation cycle”. Paromomycin 
binds to helix 44 of 18S rRNA in elongating ribosomes and alters its 
conformation in the decoding centre, which inhibits translation 
but also permits the productive binding of near- and non-cognate 
aminoacyl-tRNAs to the 80S ribosome A-site”. In doing so, paromo- 
mycin increases the misincorporation of amino acids into elongating 
polypeptides”. We reasoned that if slow decoding during 1-methylW 
mRNA translation was due to altered aminoacyl-tRNA binding kine- 
tics, this process could be decreased by paromomycin. This is because 
paromomycin-bound ribosomes could incorporate additional near- or 
non-cognate aminoacyl-tRNAs and effectively increase the substrate 
aminoacyl-tRNA pool at ribosome stall sites. Translation of 1-methylW 
mRNA was slower than that of unmodified mRNA and the proportion of 
premature polypeptide products was greater (Fig. 4a). However, during 
1-methylW mRNA translation, polypeptide elongation was improved 
by the addition of paromomycin, whereas paromomycin was inhibi- 
tory only to unmodified mRNA translation (Fig. 4a). Together, these 
data showthat slow translation of 1-methylW mRNA is probably due to 
ribosome stalling, which is caused by altered aminoacyl-tRNA binding, 
and which can be rescued by increasing the incorporation of near- or 
non-cognate amino acids into elongating polypeptides. 

Although there is no evidence that frameshifted products in humans 
generated from BNT162b2 vaccination are associated with adverse 
outcomes, for future use of MRNA technology it is important that 
mRNA sequence design is modified to reduce ribosome frameshifting 
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Fig. 4|+Iribosomalframeshiftingisdependent on mRNA slippery 
sequences and associated with ribosome stalling during 1-methylW mRNA 
translation. a, SDS-polyacrylamide gel electrophoresis autoradiograph of [^S] 
methionine-labelled polypeptides produced by translation of unmodified or 
1-methyIW Fluc mRNA for 30 min, including or omitting 100 uM paromomycin 
(+PMN and -PMN, respectively). b, Diagram showing putative mRNA slippery 
sequences and stop-codon-flanked windows. c, Activity of *1frameshifted 
products after translation of 1-methylW mutant mRNAs, or 1-methylW Fluc+1FS2 


events, as this may limit its future use for applications that require 
higher doses or more frequent dosing, such as the in vivo production 
of hormones. It is important to continue investigating therapeutic 
mRNA mistranslation and immunogenicity, as the evolution of antibody 
and cytolytic T cell responses against +1 frameshifted spike variants 
and peptides has not been systematically evaluated in humans and 
ELISpot responses obtained from pooled peptides may also underes- 
timate T cell responses. The main in-frame mRNA-encoded product is 
unlikely to elicit an adaptive immune response, but presentation of +1 
frameshifted products could activate T cells that target host cells. We 
reasoned that if we were able to identify +1 ribosome frameshift sites or 
sequences it would be possible to alter the mRNA sequence to reduce 
such effects. As proof of principle, we used our reporter IVT mRNA sys- 
tem. LC-MS/MS analysis showed that translation of 1-methylU mRNA 
leads to synthesis of +1 frameshifted products within the area of coding 
sequence between detected in-frame residues and downstream +1 
frameshifted residues (Fig. 3a). We searched the RNA sequence cor- 
responding to this region in the junction peptide coding sequence 
(Fig. 3b) and determinants of ribosome frameshifting from published 
mechanisms, from which we identified three potential ribosome slip- 
pery sequences (Fig. 4c), with all three sequences having the potential 
to be decoded by the same aminoacyl-tRNA at an in-frame codon or in 
the immediate +1 frame codon. Notably, six slippery sites identical to 


control mRNA, for 2 h. Fluc+1FS2 versus U*187C, P= 0.024; Fluc+1FS2 versus 
U*208C, P= 0.0236 (one-way ANOVA with Dunnett's test). d, Total mRNA 
translation over 2h for each of Fluct1FS2 mRNA or mutant mRNAs, quantified 
by [5S]methionine incorporation. CPM, counts per minute. e, Western blot 
analysis (anti-Flag epitope) of polypeptides produced by translation of mRNAs 
inc, and U*187C/U*208C double-mutant 1-methylU mRNA. Dataare fromn=3 
replicated experiments. a and e show representative images fromn=3 
replicated experiments. For gel source data, see Supplementary Figs. 4 and 5. 


Fluc+1FS slippery sites B and C were also distributed in the BNT162b2 
spike mRNA coding sequence. These sites have been annotated in 
the Fluc+1FS coding sequence (Fig. 4b) and the BNT163b2 spike 
mRNA coding sequence (Extended Data Fig. 6). We reasoned that 
these sequences could therefore function as sites for +1 ribosomal 
frameshifting. We synonymously mutated each site in I-methylU 
Fluc+1FS mRNA such that the in-frame amino acid was unchanged, but 
the immediate +1 frame codon was mutated to a non-cognate amino 
acid, hence destroying the ribosome slippery sequence, and translated 
the mRNAs to evaluate the contribution of each site to +1 ribosomal 
frameshifting (Fig. 4c). A +1 frame stop codon was present downstream 
of slippery site A, and it was unlikely that frameshifting at this site con- 
tributed toincreased luciferase activity. As expected, luciferase activity 
produced by translation of site A mutant A206G mRNA was the same 
as control levels (Fig. 4c). However, both slippery site B mutant U*187C 
mRNA and slippery site C mutant U*208C mRNA strongly decreased +1 
ribosome frameshifting (Fig. 4c). Notably, translation efficiency of each 
mRNA was equal, which suggests that no mutation adversely affected 
mRNAtranslation overall, but solely +1 ribosomal frameshifting activity 
(Fig. 4d). Translation of a U*187C/U*208C double-mutant 1-methylP 
Fluc+1FS mRNA produced no detectable +1 ribosome frameshifting 
(Fig. 4e). The transframe protein product predicted by +1 frameshift- 
ing at slippery site C contains an alteration of 19 amino acid residues 
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(compared to WT Fluc), whereas +1 frameshifting at slippery site B 
produces a transframe product that is effectively 100% homologous 
to WT Fluc. In addition, given that mutation of either slippery site B 
or C (U*187C or U*208C) significantly decreased luciferase activity, 
but that relatively more frameshifted product was produced by trans- 
lation of U*208C mRNA (Fig. 4e), we reasoned that the transframe 
product produced by frameshifting at slippery site C had lower specific 
luciferase activity, and that frameshifting at slippery site B contrib- 
uted to most of the detected luciferase activity as a consequence of 
+1 ribosome frameshifting. Taken together, these data suggest that 
N'-methylpseudouridylation at defined mRNA sequences triggers 
ribosome +1 frameshifting; however, with appropriate mRNA sequence 
design, it is possible to ameliorate this issue. 


Conclusions 


We show that 1-methyl is a modified ribonucleotide that significantly 
increases +1 ribosomal frameshifting during mRNA translation and that 
cellular immunity to +1 frameshifted products can occur following vac- 
cination with mRNA containing 1-methylU. To our knowledge, this is the 
firstreport that mRNA modification affects ribosomal frameshifting. 
Alongside this impact on host T cell immunity, the off-target effects of 
ribosomal frameshifting could include increased production of new 
B cell antigens. Other ribonucleotide modification strategies, such 
as incorporation of 5-methoxyU, significantly decreased translation 
efficiency of IVT mRNAs, which may limit clinical translation. Although 
we have shown that translation of N'-methylpseudouridylated mRNA 
leads to +1 ribosomal frameshifting in vitro and in cultured cells, it is 
conceivable that other mistranslation events (such as leaky scanning) 
could also contribute to T cell responses to +1 frameshifted peptide 
antigens. We show that IVT mRNAs contain few nucleotide insertions 
and deletions, and this is not changed by 1-methylY incorporation. Our 
data show that +1 ribosomal frameshifting occurs at two character- 
ized slippery sequences. Therefore, we believe that the minor band of 
approximately 50 kDa produced by Fluc+1FS mRNA translation is prob- 
ably aconsequence of several frameshifting events (Fig. le). Translation 
of mRNA containing I-methylU leads to slower translation elongation, 
caused by altered aminoacyl-tRNA binding, which demonstrates why 
+1 ribosomal frameshifting does not occur during unmodified mRNA 
translation—both ribosome stalling and ribosome slippery sequences 
seem to be required for productive +1 ribosome frameshifting. Our 
mechanistic data are supported by previous observations of ribosomal 
frameshifting during translation of naturally occurring mRNAs, which 
implicate ribosome stalling and require ribosome slippery sequences 
for *1frameshifting??9?555?* These findings are of particular impor- 
tance to our fundamental understanding of how ribonucleotide modi- 
fication affects mRNA translation, and for designing and optimizing 
future mRNA-based therapeutics to avoid mistranslation events that 
may decrease efficacy or increase toxicity. 
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Methods 


Plasmids and mRNA synthesis 

Phusion High-Fidelity DNA polymerase reagents were obtained from 
New England Biolabs. In-frame WT Fluc template DNA was produced by 
Xbal digest of pUCK100Fluc, including an 80-nucleotide polyA tail”. 
Fluc+1FS template DNA was produced by overlap extension PCR of 
pUCK100Fluc using FlucFlag F (5’-TTATACCATGGGTGACTACAAAGA 
CCATGACGGTGAT TATAAAGATCATGACATCGAT TACAAGGATGACGA 
TGACAAGCTCGAAGACGCCAAAAACATAAAGAAAGG-3’), Fluc*1FS R 
(5’-GAT TGCCGAAAAATAGGATCTCTGGCATG-3’) for Fluc+1FS NFluc, 
Fluc+1FS F (5’-CAGAGATCCTATTTT TCGGCAATCAAATCAT-3’) and 
Fluc_R (5’-TAGATTGCTAGCT TATGT TAAT TACACGGCGATCTTTCCG-3’) 
for Fluc+1FS CFluc. PCR products were reinserted into pUCK100 
using Ncol and Nhel, and linear template DNA was produced by Xbal 
digest. A206G, U*187C and U*208C mRNA were transcribed from cus- 
tom genes subcloned into pUC57T7 (Genscript Biotech) and linear 
template DNA was produced by BamHI digest. Fluc+1FS2 mRNA was 
produced from Fluc+1FS template DNA subcloned into pUC57T7 and 
linearized by BamHI. U*187C/U*208C template DNA was produced 
by overlap extension PCR and reinsertion into pUC57. Reporter RNA 
sequences are shown in Supplementary Fig. 1. In vitro transcription 
was carried out using TranscriptAid T7 High Yield Transcription Kit 
(Thermo Scientific KO441). UTP and CTP were substituted where 
required for 5-methoxyUTP, N'-methylpseudoUTP or 5-methylCTP. 
Modified nucleotides were obtained from Trilink Biotechnologies. 
Transcripts were 5’-capped using the Vaccinia Capping System (NEB 
M2080S) and purified by phenol-chloroform extraction and GSO size 
exclusion. Transcripts were quantified using a Nanodrop ND2000 
spectrophotometer (Thermo Scientific) and stored at -80 °C. 


RNA gel electrophoresis 

Samples were heated in formamide with bromophenol blue and xylene 
cyanol dye for 3 min at 95 °C, cooled for 2 min on ice and resolved ona 
1% agarose formaldehyde MOPS acetate gel for 90 min at 90 V. The gel 
was stained in 0.5 pg ml ethidium bromide for 1h, bathed in distilled 
water for 1h and visualized by UV transillumination. 


Cell culture and mRNA transfection 

HeLa cells wereagift from the Proudfoot Lab, University of Oxford. Cells 
were authenticated by STR typing. Cells were grown in DMEM (Gibco 
41966029), supplemented with 10% FBS at 37 °C, 5% CO,. Cells were 
tested for mycoplasma contamination and tested negative. Approxi- 
mately 16 h before transfection, cells were seeded at 0.2 x 10° mI in 
6-well plates. Ten minutes beforetransfection, the medium was changed 
to OptiMEM (Gibco 31985062), after which cells were transfected with 
4 pmolFluc*1FS mRNA in Lipofectamine 2000 (Invitrogen 11668019). 
After 4 htransfection, OptiMEM was replaced with DMEM, and cells were 
cultured for a further 4 h, and then lysed in Passive Lysis Buffer (Promega 
E1941). Lysates were centrifuged (10,000g, 5 min) and luciferase acti- 
vity was determined in supernatants using the Luciferase Assay System 
(Promega E4550) and GloMax multi-well plate luminometer (Promega). 


Invitro translation 

IVT mRNAs were translated using the Flexi Rabbit Reticulocyte 
Lysate System using nuclease-treated RRL (Promega L4540). For 
co-translational labelling, 0.33 pi translation-grade [^S]methionine 
(Hartman Analytic KSM-01) and 0.67 pl amino acids minus methionine 
(Promega L996A) were used per 15 pi reaction. Unlabelled products were 
produced with 1 pl total (unlabelled) amino acids (Promega L4461). The 
concentration of IVT mRNA was 50 nMand, when included, the concen- 
tration of paromomycin (Sigma Aldrich P9297) was 100 pM. Creatine 
phosphate (Roche 10621714001), creatine kinase (Roche 21778721), 
potassium acetate (Sigma Aldrich P1190) and magnesium acetate (Sigma 
Aldrich M5661) were included at 10 mM, 25 pg mI, 50 mM and 0.5 mM, 


respectively”. Reactions were incubated at 30 °C for the indicated time 
and moved toice, after which 10 ul RNase A/T1 and Benzonase was added 
and incubated for 10 min. Luciferase activity was determined using the 
Luciferase Assay System (Promega E4550) and measured using a GloMax 
multi-well plate luminometer (Promega). The relative proportion of 
gain-of-function luciferase activity from Fluc+1FS mRNA translation was 
calculated by the relative light units (RLUs) for each mRNA translation 
reaction as a percentage of unmodified WT Fluc mRNA translation RLUS. 
For western blotting, 2x reducing LDS PAGE buffer was mixed with each 
sample, which was heated to 70 °C for 10 min. Cooled samples were 
resolved on NuPAGE 4-12% or 12%, Bis-Tris, 1.0 mm, Mini Protein Gels 
(Invitrogen). The resolved products were transferred to nitrocellulose 
membrane and probed using anti-Flag M2 antibody (Sigma Aldrich 
F1804) and anti-mouse-HRP antibody (Dako P0447), and detected with 
Clarity Western ECL substrate (Bio-Rad 1705060). Figures 1 and 4 show 
translation reactions from n = 3 replicates. 


Peptide LC-MS/MS analysis 

IVT mRNA was translated as above. After RNA digestion, translation 
products were immunoprecipitated using anti-Flag magnetic agarose 
beads (Pierce) overnight at 4 °C. Beads were washed twice in PBS, once 
in water, eluted in LDS PAGE buffer, and resolved on a NuPAGE 4-12%, 
Bis-Tris, 1.5 mm, Mini Protein Gel (NPO335BOX). The gel was stained 
with Coomassie dye and the region between about 60 kDa and 75 kDa 
(Precision Plus Protein All Blue Prestained Protein Standard; Bio-Rad) 
was excised and processed for mass spectrometry analysis as previously 
described”. In brief, the excised gel slice was cut into 1-mm pieces and 
placed in an 1.5-ml microtube. Coomassie staining was removed by 
incubating alternatively with a mixture of 25 mM ammonium bicar- 
bonate and acetonitrile (2:1) and 25 mM ammonium bicarbonate. Each 
15-min incubation at 37 °C was repeated until gel pieces were completely 
destained. Reduction and alkylation of cysteines was carried out by first 
incubating with a fresh 10 mM final concentration of dithiothreitol in 
25 mM ammonium bicarbonate at 60 °C for 60 min and then changing 
the solution to 60 mM final concentration of iodoacetamide in 25 mM 
ammonium bicarbonate and incubating for an addition 45 min at room 
temperature in the dark. After dehydrating the gel pieces with aceto- 
nitrile, trypsin solution (10 ng pl in 25 mM ammonium bicarbonate) 
or AspN (1 ng pl in 25 mM ammonium bicarbonate) was added until 
gel pieces were completely covered. Digestion was carried at 37 °C for 
16 h, 1,000 r.p.m. shaking. Proteases were inactivated by adding formic 
acid (trypsin digest) or TFA (AspN digest) to a final concentration of 1% 
(v/v). Peptides were extracted by sequential incubations with water/ 
acetonitrile/formic acid (50:49:1 and 80:19:1% (v/v)). Extracted pep- 
tides were pooled and dried to completion and resuspended in water/ 
acetonitrile (97:3% (v/v)) with 0.1% (v/v) TFA for mass spectrometry 
analysis. Mass spectrometry analysis was carried out once for trypsin 
digest and once for AspN digest. 


Mass spectrometry analysis 

In-gel digests were analysed using an Ultimate 3000 RSLC nanosystem 
(Thermo Scientific) coupled to an Orbitrap Eclipse mass spectrom- 
eter (Thermo Scientific). The sample was loaded onto the trapping 
column (Thermo Scientific, PepMap100, C18, 300 pm x 5 mm), using 
partial loop injection, for 3 min at a flow rate of 15 ul min ! with 0.1% 
(v/v) FA in 3% acetonitrile. Peptides were separated on the analytical 
column (Easy-Spray C18 75 um x 500 mm 2 um column) at a flow rate 
of 300 nl min”! using a gradient of 97% A (0.1% formic acid)/396 B (80% 
acetonitrile 0.1% formic acid) to 25% B over 50 min, then to 4096 B for 
an additional 6 min, and then to 90% B for another 2 min, remaining 
at 90% B for 12 min before the percentage of B was then lowered to 
3.896 to allow the column to re-equilibrate for 15 min before the next 
injection. Data were acquired using two field asymmetric ion mobil- 
ity spectrometry CVs (-50 V, -70 V). For each field asymmetric ion 
mobility spectrometry experiment (maximum cycle time of 1.5 s per 
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experiment), data were acquired in data-dependent mode and MS1 
consisted of a120,000 resolution full-scan MS scan (AGC set to 100% 
(4 x 10° ions) with a maximum fill time of 50 ms) using a mass range of 
380-1,500 m/z. The intensity MS2 trigger threshold was set to 5.0 x 105, 
andtoavoid repeated selection of peptides for MS/MS, the experiment 
useda40-s dynamic exclusion window. MS/MS was carried out on the 
Orbitrap using 30,000 resolution (AGC set to 100% (5 x 10* ions) with 
amaximum fill time of 54 ms). A higher-energy collisional dissocia- 
tion collision energy of 32% was used to fragment the peptides and an 
isolation window of 1.2 was used. 


Proteome Discoverer v2.5 analysis 

Raw data were imported and data were processed in Proteome Discov- 
erer (version2.5, Thermo Fisher Scientific). The raw files were submitted 
to a database search using Proteome Discoverer with SequestHF 
against the Oryctolagus cuniculus (rabbit) database containing pro- 
tein sequences from UniProt/Swiss-Prot (Proteome ID UPOOO001811), 
appended with Fluc (Uniprot ID PO8659), predicted +1 frameshifted 
polypeptides and common contaminant proteins (several types of 
human keratin, BSA and porcine trypsin). The spectrum identification 
was carried out with the following parameters: MS accuracy, 10 ppm; 
MS/MS accuracy of 0.02 Da; up to two missed cleavage sites allowed; 
carbamidomethylation of cysteine; and oxidation of methionine as vari- 
able modifications. An interactive workflow was used in the processing 
step. After the first Sequest HT search, the Inferis Rescoring node was 
used and spectra with confidence worse than high were resubmitted 
for a second Sequest HT search using additional dynamic modifications 
(N and Q deamidation; N-terminal pyroglutamate; methionine loss 
and acetylation). Peptides were assigned to their respective reading 
frame or junction by inspection. Percolator node was used for false 
discovery rate estimation and only rank 1 peptide identifications of 
high confidence (false discovery rate « 1%) were accepted. 


RNA-sequencing analysis 

RNA-sequencing (RNA-seq) libraries were prepared from 1 ug IVT mRNA 
using NextFlex Rapid Directional RNA-seq kit 2.0 (PerkinElmer), accord- 
ing to the manufacturer’s protocol. Libraries were amplified by six 
PCR cycles and purified by PAGE. Sequencing was carried out using an 
Illumina MiSeq at the Department of Biochemistry DNA sequencing 
facility, University of Cambridge (1 x 150 cycles V3). Reads were aligned 
with STAR (version 2.7.43)". Insertions and deletions per reference 
nucleotide were mapped from high-quality reads (QC score > 35) fil- 
tered for partial alignments and normalized to read depth. Insertion or 
deletion plots show the average mutation frequency for n = 3 replicated 
RNA-seq experiments. 


SDS-PAGE autoradiography 

IVT mRNAs were translated in nuclease-treated RRL (Promega) and 
products were co-translationally labelled as described above for 
30 min. 2x LDS PAGE buffer was mixed to each sample, which was 
heated to 70 °C for 10 min. Cooled samples were resolved on NUPAGE 
12%, Bis-Tris, 1.0 mm, Mini Protein Gels (Invitrogen NPO342BOX). The 
resolved gels were fixed in 10% methanol in acetic acid for 45 min, 
and dried at 80 °C for 2 husing a Fisher gel dryer system. Images were 
obtained by autoradiography using a Typhoon FLA 9000 and storage 
phosphor screens (GE Healthcare). 


Incorporated [S]methionine quantification 

IVT mRNAs were translated in nuclease-treated RRL (Promega) and 
products were co-translationally labelled for 2 h, as described above. 
[S]methionine incorporation was assayed according to the manu- 
facturer’s protocol. In brief, after RNA digestion, reactions were incu- 
bated for 10 minin 1M NaOH. Polypeptides were precipitated in 5% 
TCA, collected on Whatman glass fibre filters, and washed three times 
with 5% TCA and once with acetone. The dried filters were immersed 


in 2 ml EcoScint liquid scintillation cocktail (National Diagnos- 
tics) and counted in a Tri-Carb 4910 TR liquid scintillation counter 
(PerkinElmer). Incorporated [?S]methionine was determined from 
cpm of precipitated polypeptides per counts per minute of unwashed 
filters for each reaction (total counts per minute). 


Mouseimmunization 

C57BL/6) mice (female, WT) were purchased from Charles Rivers Labo- 
ratories. Mice of 8-12 weeks old were intramuscularly injected with 
three doses of 10 ug BNT162b2, 5 x 10’ infectious units of ChAdOx1 
nCoV-19 or untreated. For booster immunizations, the same dose of 
therespective vaccine was injected 3 weeks and 6 weeks apart into the 
samesiteasthe primary immunization. Spleens were obtained at day 
8post-third dose and cell suspensions were prepared. In brief, spleens 
were mashed with a syringe plunge and filtered through 70-um cell 
strainers. Red blood cells were lysed with RBC lysing buffer (155 mM 
NH, CI, 12 mM NaHCO,, 0.1 mM EDTA), before counting and cryo- 
preserving before ELISpot assays. Mice were not randomly assigned 
to groups and experimenters were not blinded to experiments. 


IFNy ELISpot 

Human IFNy ELISpot assays were carried out as previously described 
using the human IFNy ELISpot PLUS kit (ALP; MabTech 3420-4APT)*5. 
Overlapping peptide pools corresponding to: in-frame SARS-CoV-2 
spike protein (spike, 158 peptides); spike protein S1 and S2 regions 
(S1 + S2), which were described previously’; SARS-CoV-2 M protein, 
which was described previously*®; and peptides predicted to occur by 
translation of the BNT162b2 mRNA +1 frame exclusively (+1FS spike, 
123 peptides) were used. Peptides were obtained from Mimotopes, 
andare listed in Extended Data Table 3. Cryopreserved PBMCs were 
thawed in RPMI1640 medium supplemented with 1% (v/v) penicillin- 
streptomycin (Sigma), containing 0.0196 (v/v) Benzonase nuclease 
(Merck). PBMCs were washed and then incubated for 1-2 h at 37 °C, 
5% CO, in RPMI1640 medium, 10% (v/v) human AB serum and 1% (v/v) 
penicillin-streptomycin. Pre-coated IFNy ELISpot 96-well plates 
(MabTech 3420-4APT-2) were washed three times with PBS and then 
blocked with RPMI1640 medium, 10% (v/v) human AB serum and 1% 
(v/v) penicillin-streptomycin for 45 min. Overlapping peptide pools 
were plated at 4 ug mI, 50 ul per well; dimethylsulfoxide (Sigma) was 
used as the negative control at the equivalent concentration to the 
peptides. A total of 200,000 cells in 50 ul were added and incubated 
for 18-24 h. Cells were discarded, and plates were washed with PBS 
0.05% (v/v) Tween (Sigma), and incubated with IFNy detector antibody 
(clone 7-B6-1,1 ug mI?) for 2-4 h at room temperature. Washed plates 
werethen incubated with streptavidin alkaline phosphatase antibody 
(1 pg mI?) for 1-2 h. Plates were washed and then colour development 
was carried out using 1-step NBT/BCIP substrate solution. A 50 ul volume 
of filtered NBT/BCIP was added to each well for 5 min at room tempera- 
tureafter which development was stopped with cold water. Plates were 
dried at room temperature for approximately 48 h. Spots were quanti- 
fied using an AID iSpot Spectrum EliSpot Reader (AID EliSpot Software 
version 7.0, Autoimmun Diagnostika). Average spot count value inthe 
background wells was subtracted from that of the test wells and values 
were expressed as SFUs per million cells. Mouse IFNy ELISpot assays 
were carried out using cryopreserved splenocytes thawed as above and 
incubated in RPMI1640 medium with 10% FBS alone. Peptide stimula- 
tions and downstream processing were as above, using pre-coated 
mouse IFNy ELISpot PLUS kit (ALP; MabTech 3321-4APT-2). Figure 2b,c 
shows ELISpot responses for n= 8 mice per group. Figure 2d,e shows 
ELISpot responses for n = 20 ChAdOx1 nCoV-19-vaccinated individuals 
and n = 21 BNT162b2-vaccinated individuals. 


HLA genotyping 
HLA genotyping was conducted for n = 40 donors by Histogenetics LLC 
(Ossining, New York). HLA data (Extended Data Table 1) were filtered 


to HLA-A, HLA-B and HLA-C genes and were truncated to allele group 
level. Donor genotypes for the BNT162b2-vaccinated individuals were 
visualized in a presence-absence heatmap in R (version 4.3.0) using 
ggplot2 (version 3.4.2). Major allele group information was summarized 
and individual allele group frequencies were calculated to illustrate 
the overall genetic composition. 


Statistical analysis 
Statistical analyses were carried out in base R (version 4.3.0) or using 
the DescTools R package (version 0.99.46). 


Ethics statement 

Animal experiments were licensed by the UK Home Office according 
to the Animals Scientific Procedures Act 1986 (License PP6047951), 
and approved and conducted in compliance with protocols by the Uni- 
versity of Cambridge, University Biomedical Services Animal Welfare 
and Ethical Review Bodies committee. Human sample collection and 
analysis was conducted in accordance with the principles of good clini- 
cal practice and following approved protocols of the NIHR National Bio- 
Resource. Samples were collected with the written informed consent of 
all study participants under the NIHR National BioResource-Research 
Tissue Bank ethics (research ethics committee (REC): 17/EE/0025) and 
fromthe PITCH study. PITCH is a substudy of the SIREN study, which was 
approved by the Berkshire REC, Health Research 250 Authority (IRAS 
ID 284460, REC reference 20/SC/0230), with PITCH recognized as a 
substudy on 2 December 2020. SIREN is registered with ISRCTN (Trial 
ID: 252 ISRCTN11041050). Some participants were recruited under 
aligned study protocols. In Liverpool, some participants were recruited 
under the study Human immune responses to acute virus infections 
(16/NW/0170), approved by North West - Liverpool Central REC on 8 
March 2016, and amended on 14 September 2020 and 4 May 2021. In 
Oxford, participants were recruited under the GI Biobank Study 16/ 
YH/0247, approved by the REC at Yorkshire & The Humber - Sheffield 
REC on 29 July 2016, which was amended for this purpose on 8June 2020. 
The study was conducted in compliance with all relevant ethical regula- 
tions for work with human participants, and according to the principles 
of the Declaration of Helsinki (2008) and the International Conference 
on Harmonization Good Clinical Practice guidelines. Written informed 
consent to publish clinical and genetic data, as well as for study partici- 
pation, was obtained for all participants enrolled in the study. 


Reporting summary 
Further information on research design is available in the Nature 
Portfolio Reporting Summary linked to this article. 


Data availability 


Mass spectrometry data have been deposited with MassIVE ID 
MSV000093074. RNA-seq reads and processed files are available at 


the NCBI Gene Expression Omnibus (accession GSE223044). Addi- 
tional data are available from figshare (https://doi.org/10.6084/ 
m9.figshare.24271744). The following accessions were used for mass 
spectrometry analysis: UPOOOOOISI1 and PO8659 (UniProt). Source 
data are provided with this paper. 


Code availability 
Scripts for processing alignments are available from GitHub”. 
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Extended Data Fig. 1| Validation of WT Fluc, Fluc + 1FS, and Fluc-1FS c, Relative luciferase expression produced by translation of WTFluc and 
mRNAs. a, UV photograph of WTFlucandFluc-1FS mRNA transcripts analysed Fluc+1FS mRNAs. Data were obtained from n =1(non-replicated) analyses. 
by agarose gel electrophoresis. b, Immunoblot of luciferase protein produced For gel source data, see Supplementary Fig. 6. 

by translation of WTFluc and Fluc+1FS mRNAs using anti-FLAG antibody. 
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Extended Data Fig. 2| Visualisation of HLA genotypes in BNT162b2- peptide pool; (ii) donor identifier; (iii) ELISpot count if >40. SFU/million. 
vaccinated individuals. Major allele groups in HLA-A, -B and -C genes for Allele group frequency indicated by black tile, vertical dashed lines: HLA genes, 


n=40 donors. The Sample ID contains 3 fields separated by underline: (i) 1or 0, horizontal dashed line: indicates the cut-off (ELISpot count >40). 
with 1 denoting an ELISpot count >40 following stimulation with the +1 FS 
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Query: SARS-CoV-2 S gene +1 frame translation 
Sbjct: BNT162b2 mRNA +1 frame translation 


Query 34 VFITLTKFSDPQOFYIQLRTCSYLSFPMLLGSMLYMSLGPMVLRGLITLSYHLMMVFILLP 93 
T T+ SDP RTCS LS Gs PM R T L 
Sbjct 50 ACTTPTRCSDPACCTLPRTCSCLSS-ATPGSTPSTCPAPMAPRDSTTPCCPSTTGCTLPA 108 


Query 94 LRSLT--EAGFLV--LLIRRPSPYLLLITLLMLLLKSVNFNFVMIHFWVFITTKTTKVGW 149 
RS T EAG R P T K ++ W TT+TT+ GW 
Sbjct 109 PRSPTSSEAGSSAPHWTAR---PRACSTTPPTWSSKCASSSSATTPSWASTTTRTTRAGW 165 


Query 150 KVSSEFILVRIIALLNMSLSLFLWTLKENRVISKILGNLCLRILMVILKYILSTRLLI-C 208 
K SS A + SL WT K +R Sł + CLR + ST L c 
Sbjct 166 KASSGCTAAPTTAPSSTCPSLS-WTWKASRATSRTCASSCLRTSTATSRSTASTPLSTSC 224 


Query 209 VISLRVFRL--NHWIC-QVLTSLGFKLYLLYIEVILLVILLOVGOLVLOLIMWVIFNLGL 265 
I LR L WIC TS GF+ E LI GOLV L MW +L 
Sbjct 225 GICLRASLLWNPWWICPSASTSPGFRHCWPCTEATHLAIAAADGOLVPPLTMWATCSLEP 284 


Query 266 FYNIMKMEPLOMLTVHLTLSOKOSVRNPS-LKKESIKLLTLESNOONLLLDFLILOTCAL 324 
#7 SA UP +L+ +0S +PS K+ S+ T + N I CA 
Sbjct 285 SCSTTRTAPSPTPWIVLWILARQSAPSPSPWKRASTRPATSGCSPPNPSCGSPISPICAP 344 


Query 325 LVKFLTPPDLHLFMLGTGRESATVLLIILSYIIPHHFPLLSVMECLLLNMISALLMSMQI 384 
+ PPD L GTG SA P S CLH+H A 
Sbjct 345 SARCSMPPDSPLCTPGTGSGSAIAWPTTPCCTTPPASAPSSATACPLPSTTCASOTCTPT 404 


Query 385 HLLEVMKSDKSLOGKLERLLIIIINYOMILOAALLGILTILILRLVVIIITCIDCLGSLI 444 
MK + BR ek ALG T * ITC CGS I 
Sbjct 405 ASSGEMKCGRLPLDROARSPTTTTSCPTTSPAVLPGTATTWTPKSAATTITCTGCSGSPI 464 


Query 445 SNLLREIFOLKSIRPVAHLVMVLKVLIVTFLYNHMVSNPLMVLVTNHTEYFLL-NFYMHO 503 
S *SIRP A LV K T * *PM + TE++ N M 
Sbjct 465 SPSS-GTSPPRSIRPAAPLVTAWKASTATSHCSPTAFSPOMAWAISPTEWWCASNCCMPL 523 


Query 504 QLFVDLKSLLIWLKTNVSISTSMVQAQVFLLSLTKSFCLSNNLAE-TLLTLLMLSVIHRH 562 
L+ I +TN STS A T+S C SLA + + L IRH 
Sbjct 524 POCAALRKAPI-SRTNA-TSTSTAPAPACOR-ATRSSCHSSSLAGISPIPOTPL-EIPRH 579 


Query 563 LRFLT--LHHVLLVVSVLHQEQILLTRLLFFIRMLTAQKSLLLFMQINLLLLGVFILQVL 620 
a OTS NTs +S L + R R T K F I+L G 
Sbjct 580 WKSWTSPLAASAECLSPLAPTPAI--RWOCCTRT-TVPKCPWPFTPISHLHGGC-TPPAA 635 


Query 621 MFFKHVQAVGLNMSTTHMSVTYPLVOVYALVIRLRLILLG-GHVVLVNPSLPTLCHLVOK 679 
M F+ AV +T STP A RR LG SLPT C + 
Sbjct 636 MCFRPEPAVSEPSTTIATSATSPSALESAPATRHRQTALGEPEAWPARASLPTQCLWAPR 695 


Query 680 IOLLTLITLLPYPOILLLVLPOKFYOCLPRHO-IVOCTFVVIOLNAAIFCCNMAVFVHN- 737 
T TL P PQ+ C PR cT I +A CC+ A + 
Sbjct 696 TAWPTPTTLSLSPPTSPSA-PORSCLCPPRPAWTAPCTSAAIPPSAPTCCCSTAASAPSI 754 


Query 738 TVLLELLNKTKTPKKFLHKSNKFTKHHOLKILVVLIFHKYYOIHONOARGHLLKIYFSTK 797 
N+T+TPK+ KS + T+ + I ++ I +AG + STK 
Sbjct 755 EPOGSPWNRTRTPKRCSPKS-RSTRPLLSRTSAASISARFCPILASPASGASSRTCCSTK 813 


Query 798 HLOMLASSNNMVIALVILLLETSFVHKSLTALLFCHLCSQMKLLNTLLHC-RVQSLLVGP 856 
H ASS++M I L F +SLT CIC M+ +TLC Qs G 
Sbjct 814 HWPTPASSSSMAIVWATLPPGI-FAPRSLTDQC-CLLC-PMRSPSTHLPCWPAQSQAAGH 870 


Query 857 LVOVLHYKYHLLCKWLIGLMVLELHRMFSMRTKNLPTNLIVLLAKFKTHFLPOOVHLENF 916 
LQ + LLCHW G E RM RT++ PT+ A+ +T QQ E+ 
Sbjct 871 LEQAPLCRSPLLCRWPTGSTASEP-RMCCTRTRSSPTSSTAPSARSRTAAA-QQAPWESC 928 


Query 917 KMWSTKMHKLTRLLNNLAPILVQFQVFMISFHVLTKLRLKCKLIGSQADFKVCRHMLNNL 976 
+ WST+M + T ++ P + IS T LR +C+ SQAD + RH ++ 


Sbjct 929 RTWSTRMPRHTPWSSSCPPTSAPSALCTISAD-WTLLRPRCRSTDSQADCRASRHTPSSS 987 


Query 977 ELQKSELLLILLLLKCQSVYLDNQKELIFVERAIILCPSLSQHLMVSS-CMLMSLHKKRT 1035 
E. E LOI *C SV *E F RA SLS L C M K+R 
Sbjct 988 EPPRLEPLPIWPPPRCLSVCWARAREWTFAARATTA--SLSLPLTAWCFCTHMCPLKRRI 1045 


Query 1036 SOQLLLPFVMMEKHTFLVKVSLFOMAHTGLHKGIFMNHKSLLOTTHLCLVTVMLELSTTOF 1095 
S LP K TFL K A GHG + +S TT CL T ak 
Sbjct 1046 SPPLOPSATTAKPTFLEKACSCPTAPIGSHSGTSTSPRSSPPTTPSCLATATSSALTIPC 1105 


Query 1096 MILCNLN-THSRRS-INILRITHHOMLIVTSLALMLOLTFKKKLTASMRLPRIMNLSSIS 1153 
LC+ + T S+R+ + LR A + + 4K + R PRI S+ 
Sbjct 1106 TTLCSPSWTASKRNWTSTLRTTQAPTWTWAISAESMPASTSRKRSTGTRWPRITRASTC- 1164 


Query 1154 KNL 1156 
KN 
Sbjct 1165 KNW 1167 


Extended Data Fig. 4 | NCBI BLASTP alignment of + 1 translated products. Protein BLAST alignment of polypeptides predicted by +1 frame translation of either 
BNT162b2 mRNA or Wuhan SARS-CoV-2 Spike mRNA (from NC_045512.2). 
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Extended Data Fig. 5| Translation of 1-methylW-modified Fluc mRNA 
produces multiple polypeptides. a, Diagram of NFLAG-WTFluc mRNA. 

b, Western blot analysis (anti-FLAG epitope) of polypeptides produced by 
translation of unmodified, or 1-methylW, NFLAG-WTFluc mRNA. In-frame 
firefly luciferase is indicated by arrow. Low molecular weight polypeptides are 
indicated by asterisk (*). A single blot from n =3 replicated experiments is 
displayed. For gel source data, see Supplementary Fig. 7. 
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AGAAGCGmUGGCCAGCCAGAGCAmUCAmUmUGCCmUACACAAmU/ GmU CmU CmU/GGGCGCCGAGAACAGCGmUGGCCmUACmU/CCAACAACmU CmU/ 
AmUCGCmU/AmUCCCCACCAACmUmU/CACCAmUCAGCGmUGACCACAGAGAmUCCmUGCCmUGmUGmUCCAmUGACCAAGACCAGCGmUGGACmU 
GCACCAmUGmUACAmUCmUGCGGCGAmUmU/.CCACCGAGmUGCmU/CCAACCmUGCmUGCmU/GCAGmU ACGGCAGCmUmU/CmUGCACCCAGCmUGA 
AmU/AGAGCCCmUGACAGGGAmUCGCCGmUGGAACAGGACAAGAACACCCAAGAGGmUGmUmU/CGCCCAAGmUGAAGCAGAmUCmUACAAGACCCCm 
UccmU/AmUCAAGGACmUmUCGGCGGCmUmUCAAmUmUmUCAGCCAGAmUmU CmUGCCCGAmU CCmU/AGCAAGCCCAGCAAGCGGAGCmUmUCA 
mUccAGGACCmUGCmUGmUmU/CAACAAAGmUGACACmUGGCCGACGCCGGCmUmUCAmUCAAGCAGmU/AAmUGGCGAmUmU/GmU CmU/GGGCGACA 
mUmUGccGcccAcGGAmUCmUGAmUmUmUGCGCCCAGAAGmUmUmU/AACGGACmUGACAGmUGCmUGCCmU CCmUCmU/GCmU/GACCGAmUGAGA 
mUGAmUCGCCCAGmUACACAmUCmUGCCCmUGCmUGGCCGGCACAAmUCACAAGCGGCmUGGACAmUmUmU/GGAGCAGGCGCCGCmUCmUGCAGA 
mUccccmUmUmUGCmU/AmU/GCAGAmUGGCCmU/ACCGGmUmU/CAACGGCAmU CGGAGmUUGACCCAGAAmUGmUGCmU GmW ACGAGAACCAGAAGC 
mUGAmUCGCCAACCAGmUmU/CAACAGCGCCAmUCGGCAAGAmUCCAGGACAGCCmUGAGCAGCACAGCAAGCGCCCmUGGGAAAGCmUGCAGGACGm 
UGGmUCAACCAGAAmUGCCCAGGCACmUGAACACCCmUGGmUCAAGCAGCmUGmUCCmUCCAACmUmUCGGCGCCAmUCAGCmUCmUGmUGCmU 
GAACGAmUAmUCCmUGAGCAGACmUGGACCCmUCCmUGAGGCCGAGGmUGCAGAmUCGACAGACmUGAmUCACAGGCAGACmUGCAGAGCCmUCC 
AGACAmUACGmUGACCCAGCAGCmUGAmUCAGAGCCGCCGAGAmUmUAGAGCCmUCmUGCCAAmU CmU GGCCGCCACCAAGAmUGmUCmUGAGmU 
GmU/GmUGCmU/GGGCCAGAGCAAGAGAGmUGGAC mUmU/mmU/mU/GCGGCAAGGGCmU/ACCACCmUGAmUGAGCmUmU cccmUCAGmUCmUGCCCCm 
UcACGGCGmUGGmUGmUmUmU/cmUGCACGmU/GACAmUAmU/GmU/GCCCGCmU/CAAGAGAAGAAmU mU mmU/CACCACCGCmU/CCAGCCAmU CmU GC 
CACGACGGCAAAGCCCACmUmUmUcCmUAGAGAAGGCGmUGmUmUCGmUGmU/CCAACGGCACCCAmUmUGGmU/mmUCGmU/GACACAGCGGAACmUJ 
mU/CmUACGAGCCCCAGAmUCAmUCACCACCGACAACACCmUmUCGmU/GmU/ CmUGGCAACmUGCGACGmU CGmUGAmU/ CGGCAmUmUGmU/GAACA 
AmUACCGmUGmUACGACCCmUCmUGCAGCCCGAGCmU/GGACAGCmUmU/CAAAGAGGAACmUGGACAAGmU/ACmUmU/mU/AAGAACCACACAAGCCC 
CGACGmUGGACCmUGGGCGAmUAmUCAGCGGAAmUCAAmUGCCAGCGmU/CGmUGAACAmUCCAGAAAGAGAmUCGACCGGCmUGAACGAGGmUGG 
CCAAGAAmUCmUGAACGAGAGCCmUGAmUCGACCmUGCAAGAACmUGGGGAAGmUACGAGCAGmU ACAmUCAAGmUGGCCCmUGGmU ACAmU CmUJ 
GGCmUGGGCmUmUmUAmUCGCCGGACmUGAmUmU/GCCAmU/CGmUGAmUGGmU CACAAmUCAmUGCmU/GmUGmU/mU/GCAmUGACCAGCmUGCm 
UcmUAGCmUGCCmU/GAAGGGCmUGmUmU/GmU/AGCmUGmUGGCAGCmUGCmU/GCAAGmUmU CGACGAGGACGAmUmU CcmUGAGCCCGmUGCmUG 
AAGGGCGmUGAAACmUGCACmUACACAmUGA 


Ribosome slippery site 


Extended Data Fig. 6| Annotated BNT162b2 Spike mRNA CDS putative (N)I-methylpseudouridine and X is (N)1-methylpseudouridine or cytidine in 
slippery sites. Putative ribosome slippery sites in BNT162b2 which were thefirst nucleotide ofthe immediate downstream codon. 
identified by the following formula: mIUm1UmIU X, where m1U is 
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Extended Data Table 1| LC-MS/MS analysis of Fluc + 1FS high-molecular weight polypeptide 


[Confiden 
Ice by 
Search |Percolator q- Percolator PEP |XCorr by 
Number |NumberlNumber of Engine alue by Search|by Search Search |Top 
Group [Confiden Modificat|Qvality |Qvality q- |of lof Missed [Theo MHplus |Sequest |Engine Sequest |Engine SequestiEngine [Apex RT 
ID. [Checked |ce Sequence ions PEP value Proteins |PSMs |Cleavages |n Da HT HT HT |jSequest HT |in min |Frame 
IGPAPFYPLEDGTAGEQL 0.00014) 0.0008289| 
9935] TRUE |High HK 3] 7| 2 0| 2026.98688High 0.000708 5.42E-05| 2.02,  39.85|n frame 
0.00627] 0.0008289| 
33451] TRUE High |YGLNTNHR 1 7| 2 0|  974.48025High 0.000708 0.003422! 1.54) 17.78|in frame 
1xOxidat| 0.00379| 0.0008289| 
6239| TRUE High ELLNSMGISQPTVVFVSK]jon [M6] | 2 7 2 0| 1965.03614/High 0.000708 0.001972) 1.62] 46.54jin frame 
0.10602! 0.0098011 
12509| TRUE High KLPIIQK 2 8 3 839.5713High 0.00738: 0.07696} 1.43 — 25.88jin frame 
xOxidat| 0.05249| 0.0036367| 
15467| TRUE |High IIMDSK ion [M4] | 9 2 0|  835.45937High 0.00283; 0.03515) 1.72| 23.99in frame 
xOxidat| 0.00100) 0.0008289| 
29537| TRUE High TIALIMNSSGSTGLPK ion [M6] | 1 7| 0| 1605.85163High 0.000708 0.000459 2.15 38.1|in frame 
0.02787| 0.0008289] 
7693 TRUE High FEEELFLR 1 7| 0| 1082.55169High 0.000708 0.01757, 2.31 A3.71j41FS 
0.0008289 
33224] TRUE High |YOLSNLHEIASGGAPLSK] 9.19E-06| 7 0| 1871.94977High 0.000708 2.68E-06| 2.98 41.34j41FS 
0.01723| 0.0008289 
7351| TRUE High IEVGEAVAK 1 7 0| 802.43051|High 0.000708 0.01038} 0.94 17.65)+1FS 
JaGYGLTETTSAILITPEG 0.02150} 0.0008289| 
23417| TRUE High DDKPGAVGK 9 7 0| 2718.38323High 0.000708 0.01322) 1.04 — 42.98)+1FS 
0.09178} 0.0090845| 
32478| TRUE High IVPFFEAK 2 6| 0|  936.51893High 0.00689; 0.06561 1.62 — 39.85j+1FS 
0.02057} 0.0008289] 
29665| TRUE |High |ILGVNQR 6 7| 0| 787.44208High 0.000708 0.01259 1.05  19.53|HTFS 
xOxidat 
IGPMIMSGYVNNPEATNA [ion [M3; 0.0008289} 
9994] TRUE High LIDK M5] 4.34E-07| 7 0| 2267.06824High 0.000708 9.50E-08| 3.74  39.39|HTFS 
0.00010) 0.0008289 
6401| TRUE High IEIVDYVASQVTTAK 9 7 0| 1523.79516High 0.000708 4.02E-05| 2.52  41.76|H1FS 
0.00353| 0.0008289| 
9405| TRUE High IGGVVFVDEVPK 5] 7| ol 1145.6201|High 0.000708 0.001817] 2.87, 38.97j41FS 
Iryptic peptides identified by purification and LC-MS/MS analysis of the high molecular weight product produced by 1-methylY4 Fluc+1FS mRNA translation. 


Extended Data Table 2 | Insertion and Deletion Frequencies in unmodified and 1-methylU mRNAs 


a i Mean S.D. 

ample grou 

eee Deletions Insertions Deletions Insertions | 
unmodified 0.000862] 0.000832 0.000118 0.000079] 
1-methyl'P 0.000625 0.000848 0.000080 0.000263 
P-value 0.05297 0.9311 


Comparison of single nucleotide deletion frequency or insertion frequency, normalised to total reads (%), for unmodified or 1-methylY mRNA (Welch's unpaired two-tailed T-test). 
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Extended Data Table 3 | Spike and + TFS Spike peptides 


Pool Sequence equence equence 

IMFVFLVLLPLVSSQCVNL IPFAMQMAYRFNGIGV pike [Spike RDPQTLEILDITPCSF |FTFS spike |AWPTTPCCTTPPASAP 

PLVSSQCVNLTTRTQL 'AYRFNGIGVTQNVLY i i ILDITPCSFGGVSVI TFS spike |CTTPPASAPSSATACPL 

T |GGV: [ETFS spike [TTC/ 
IV ANGFNSA N FS spike A 

[TNSFTRGVYYPDKVFR TANQFNSAIGKIQUSL |AVLYQDVNCTEVPVAI [ETFS spike [.PLDRQARSPTTTTSCPT 
Spike |51 |GVYYPDKVFRSSVLHSTQ AIGKIQUSLSSTASAL |CTEVPVAIHADQLTPTWR [ETFS spike SPTTTTSCPTTSPAV 
Spike |51 [FRSSVLHSTQDLFLPFF SLSSTASALGKLQDVV [Spike HADQLTPTWRVYSTGSNV [ETFS spike [.PGTATTWTPKSAATTI 
Spike |51 [STQDLFLPFFSNVTWF ALGKLQDVVNQNAQAL pike Spike RVYSTGSNVFQTRAGCL TFS spike TPKSAATTITCTGCSGSP. 
Spike |51 LPFFSNVTWFHAIHV VVNQNAQALNTLVKQL pike [Spike NVFQTRAGCLIGAEHV FIFS spike AATTITCTGCSGSPI 
Spike {S1 NVTWFHAIHVSGTNGTKR [Spike (2 |DALNTLVKQLSSNFGA] pike [Spike |CLIGAEHVNNSYECDIPT FIFS spike [SPSSGTSPPRSIRPAAPL 
Spike |5! HVSGTNGTKRFDNPVLPF [Spike [52 KQLSSNFGAISSVLNDIL [Spike [Spike INNSYECDIPIGAGICASY TFS spike PRSIRPAAPLVTAWK 
Spike |5! KRFDNPVLPFNDGVYF ISSVLNDILSRLDKV Spike [Spike PIGAGICASYQTQTNSPR TFS spike PLVTAWKASTATSHCSPT 
Spike |51 VLPFNDGVYFASTEKSNI DILSRLDKVEAEVQIDR — [Spike [Spike ISYQTQTNSPRRARSV TFS spike STATSHCSPTAFSPOMAW. 
Spike |51 YFASTEKSNIIRGWIF VEAEVQIDRLITGRL Spike Spike IPRRARSVASQSIIAYTM FIFS spike [PTAFSPQMAWAISPTEWW 
Spike |51 KSNIIRGWIFGTTLDSK IDRLITGRLOSLQTYV Spike pike QSIIAYTMSLGAENSV IFTFS spike ASNCCMPLPQCAALRK 
Spike |51 |WIFGTTLDSKTQSLLIV RLOSLQTYVTQQLIR pike pike MSLGAENSVAYSNNSIAT [ETFS spike [CMPLPQCAALRKAPI 
Spike |51 DSKTQOSLLIVNNATNVVI Spike [Spike /AYSNNSIAIPTNFTISV [FIF spike [JRATRSSCHSSSLAGI 
Spike |5i |VNNATNVVIKVCEFQF Spike [Spike |AIPTNFTISVITEILPV |FTFS spike HSSSLAGISPIPQTPLEI 
Spike |51 |VVIKVCEFOFCNDPFLGV EIRASANLAATKM Spike [Spike [SVTTEILPVSMTKTSV [IFS spike [SPIPQTPLEIPRAWKSW 
Spike [51 |aFCNDPFLGVYYHKNNK pike [52 |ASANLAATKMSECVL Spike [Spike |PVSMTKTSVDCTMYI TFS spike PLEIPRHWKSWTSPL 
Spike |5! LGVYYHKNNKSWMESEFR Spike _[S2__AATKMSECVLGOSKRVDF [Spike [Spike DCTMYICGDSTECSNLLL |FTFS spike WKSWTSPLAASAECL 
Spike (81 NKSWMESEFRVYSSANNC Spike [52 |VLGQSKRVDFCGKGYHLM [Spike [Spike IDSTECSNLLLQYGSF [FIF spike [SPLAPTPAIRWQCCTR 
Epike |51 FRVYSSANNCTFEYV Epike B2 |DFCGKGYALMSFPQSAPH [Spike [Spike LLQYGSFCTQLNRALTGI [HIFS spike [TVPKCPWPFTPT 
Spike ($1 |[SANNCTFEYVSQPFLMDL [Spike [52 |MSFPQSAPHGVVFLHV [Spike [Spike QLNRALTGIAVEQDK TFS spike |HLHGGCTPPAAMCFR 
Spike [S1_|YVSQPFLMDLEGKOGNFK [pike [S2__APHGVVFLHVTYVPAQEK pike Spike [GIAVEQDKNTQEVFAQVK TFS spike |CTPPAAMCFRPEPAV 
Spike [5i DLEGKQGNFKNLREFVFK [Spike [52 |HVTYVPAQEKNFITAPAI [Spike pike INTQEVFAQVKQIYKTPPI [ETFS spike |TIATSATSPSALESAPAT. 
Spike |5i |FKNLREFVFKNIDGYFKI Spike [52 |EKNFTTAPAICHDGKAHF [Spike pike KQIYKTPPIKDFGGFNF [ETFS spike |PSALESAPATRHRQTAL 
Spike [51 |FKNIDGYFKIYSKHTPI Spike 52  AICHDGKAHFPREGVFV Spike [Spike PPIKDFGGFNFSQIL TFS spike |ATRARQTALGEPEAWPAR 
Spike |51 |FKIYSKHTPINLVRDL Spike 2  JAHFPREGVFVSNGTHWFV Spike [Spike INFSQILPDPSKPSKRSFI [ETFS spike |LGEPEAWPARASLPTOCL 
Spike |51 HTPINLVRDLPQGFSAL [Spike [52 SNGTHWFVTQRNFY [spike Spike IPSKPSKRSFIEDLLFNKV [FIFS spike /ARASLPTQCLWAPRTAW 
Spike ($1 RDLPQGFSALEPLVDLPI pike [S2__HWFVTORNFYEPQI Spike [Spike FIEDLLFNKVTLADAGFI FIFS spike [ELWAPRTAWPTPTTLSL 
Spike |51 |ALEPLVDLPIGINITRF Spike B2 |QRNFYEPQITTDNTFV pike [Spike KVTLADAGFIKQYGDCL TFS spike WPTPTTLSLSPPTSPSA 
Spike (81 LPIGINITRFQTLLALHR pike [52 |QlITTDNTFVSGNCDVVI [Spike [Spike |FIKQYGDCLGDIAARDLI TFS spike |PARSCLCP 
Spike (81 RFQTLLALHRSYLTPGDS Spike {S2 |FVSGNCDVVIGIVNNTVY [Spike [Spike LGDIAARDLICAQKFNGL FIFS spike PRPAWTAPCTSAAIPPSA 
Spike (81 HRSYLTPGDSSSGWTAGA Spike |52 |VIGIVNNTVYDPLQPEL [Spike [Spike LICAQKFNGLTVLPPLL |FTES spike |CTSAAIPPSAPTCCCSTA 
Spike (81 DSSSGWTAGAAAYYVGYL pike [52 'YDPLQPELDSFKEEL [Spike [Spike |GLTVLPPLLTDEMIAQY |FTES spike |PSAPTCCCSTAASAP 
Bpike_[S1_|GAAAYYVGYLOPRTFLLK [pike [52  |PELDSFKEELDKYEK Spike Peke LTDEMIAQYTSALLAGTI [IFS spike |Q@GSPWNRTRTPKRCSPK 
Spike |S1_|YLOPRTFLLKYNENGTI Spike [52 |FKEELDKYFKNHTSPDV [Spike pike TSALLAGTITSGWTF FIFS spike [SRSTRPLLSRTSAASI 
Spike {S1 LLKYNENGTITDAVDCAL pike _[S2__|YFKNHTSPDVDLGDISGI [Spike Spike _[TITSGWTFGAGAALQIPF |FTES spike [SRTSAASISARFCPIL 
Spike |S1_|TITDAVDCALDPLSETK Spike [S2__[DVDLGDISGINASVVNI___[Spike [Spike IGAGAALQIPFAMQMAYRF FIFS spike |SARFCPILASPASGASSR 
Bpike_[S1_|CALDPLSETKCTLKSFIV pike [S2__SGINASVVNIQKEIDRL — pike [Spike IPFAMQMAYRFNGIGV FIFS spike |ASPASGASSRTCCSTK 
Spike |51 |TKCTLKSFTVEKGIY Epike [52 |VNIQKEIDRLNEVAKNL — [bike [Spike |RENGIGVTQNVLYENQKL FIFS spike |IWPTPASSSSMAIVWATL 
Spike |51 |KSFTVEKGIYQTSNFRV [bike |52 |DRLNEVAKNLNESLIDL [Spike Spike |ONVLYENQKLIANGF FIFS spike [SSSMAIVWATLPPGI 
Spike [1 |GIYGaTSNFRVQPTESIVR Spike [52 |KNLNESLIDLQELGKY Spike [Spike KLIANQFNSAIGKIQDSL TFS spike [SPSTHLPCWPAQSQAA 
Spike (81 RVQPTESIVRFPNITNL pike [52 |LIDLQELGKYEQYIKWPW [Spike [Spike ISAIGKIQDSLSSTASAL TFS spike WPAQSQAAGHLEQAPLCR 
Spike (81 |VRFPNITNLCPFGEVF pike [S2__KYEQYIKWPWYIWLGFI [Spike [Spike ISLSSTASALGKLQDVV TFS spike |GHLEQAPLCRSPLLCRW 
Spike |5! [INLCPFGEVFNATRFASV [Spike [S2__\WPWYIWLGFIAGLIAIVM _ Spike Spike LGKLODVVNQNAQALNTL TFS spike |CRSPLLCRWPTGSTA 
Spike |5! VFNATRFASVYAWNRKRI |Spike |52 |FIAGLIAIVMVTIMLCCM pike pike INQNAQALNTLVKQLSSNF FIFS spike [PRMCCTRTR 
Spike {51 [SVYAWNRKRISNCVADY Spike [52 |VMVTIMLCCMTSCCSCLK [Spike pike |ILVKQLSSNFGAISSVL [ETFS spike [SPTSSTAPSARSR 
Spike |51 KRISNCVADYSVLYNSAS [Spike [52 |CMTSCCSCLKGCCSCGSC Spike [Spike INFGAISSVLNDILSRL [+1FS spike AAQQAPWESCRTWSTRM 
Bpike_[S1_[DYSVLYNSASFSTFKCY _ pike B2  LKGCCSCGSCCKFDEDDS Spike Spike | NDILSRLDPPEAEVQI [ETFS spike APWESCRTWSTRMPR 
Spike |51 |[SASFSTFKCYGVSPTKL Spike |52 |SCCKFDEDDSEPVLKGVK [Spike [Spike |DPPEAEVQIDRLITGRL [FIFS spike [TPWSSSCPPTSAPSAL 
Spike |5! KCYGVSPTKLNDLCFTNV [Spike |52 |FDEDDSEPVLKGVKLHYT [Spike [Spike |DRLITGRLOSLQTYV FIFS spike ADWTLLRPRCRST 
Spike |51 KLNDLCFTNVYADSFVIR Spike [Spike MFVFLVLLPLVSSQCVNL [Spike [Spike LOSLQTYVTQQLIRAAEI FIFS spike [SQADCRASRHT 
Spike [5i NVYADSFVIRGDEVRQ! [bike SpkePLVSSQCVNLITRTQL pike [Spike QOLIRAAEIRASANL [IFS spike [SEPPRLEPLPIWPPPRCL 
Spike _[S1_|VIRGDEVRQIAPGQTGKI Spike [Spike NLTTRTQLPPAYTNSFTR [Spike [Spike EIRASANLAATKMSECVL TFS spike |LPIWPPPRCLSVCWARAR 
Spike [51 |DIAPGQTGKIADYNYKL Spike Spike PPAYTNSFTRGVYYPDKV [Spike Spike |AATKMSECVLGQSKRVDF |FTES spike [ELSVCWARAREWTFAAR 
Spike |51 |GKIADYNYKLPDDFTGCV [Spike [Spike|[TRGVYYPDKVFRSSVL [Spike pike LGQSKRVDFCGKGYHLM [FIFS spike WARAREWTFAARATT 
Spike |5! KLPDDFTGCVIAWNSNNL [pike [Spike KVFRSSVLHSTQDLFLPF [Spike pike IKRVDFCGKGYHLMSF [IFS spike [ASLSLPLTAWCFCT 
Spike |51 |CVIAWNSNNLDSKVGGNY [pike |Spike|HSTQDLFLPFFSNVTWF pike pike |LMSFPQSAPHGVVFLHV [IFS spike |AMCPLKRRISPPLQPSAT 
Spike _[S1_INLDSKVGGNYNYLYRLFR_[pike [Spike LPFFSNVTWFHAIHV Spike pike PHGVVFLHVTYVPAQEK TFS spike |ISPPLOPSATTAKPTFL 
Spike |5i NYNYLYRLFRKSNLKPF bike [Spike |FHAIHVSGTNGTKRF. Spike pike TYVPAQEKNFTTAPAT FIFS spike |LQPSATTAKPTFLEK 
Spike |51 FRKSNLKPFERDISTE|  [pike [Spike |TNGTKRFDNPVLPFNDGV [Spike pike IKNFTTAPAICHDGKAHF [IFS spike |PTFLEKACSCPTAPI 
Spike [S1_[PFERDISTEIVQAGSTPC |Spike [Spike NPVLPFNDGVYFASTEK [Spike pike CHDGKAHFPREGVFV [FIFS spike HSGTSTSPRSSPPTTPSC 
Spike [S1_[EIVQAGSTPCNGVEGF [bike |Spike|GVYFASTEKSNIIRGWIF — [Spike pike |FPREGVEVSNGTHWEV [IFS spike [RSSPPTTPSCLATAT 
Spike [S1_STPCNGVEGFNCYFPL pike |Spike|KSNIIRGWIFGTTLDSK [Spike pike [SNGTHWFVTQRNFYEPQI [HFS spike [TIPCTTLCSPSWTASKR 
Spike |5! VEGFNCYFPLOSYGF Epike Bpike|FGTTLDSKTOSLLIV Spike [Spike |TQRNFYEPQIITTDNTFV FIFS spike |SPSWTASKRNWTSTLR 
Spike |51 |CYFPLOSYGFQPTNGVGY [Spike [Spike KTOSLLIVNNATNVVIKV — [Spike [Spike GIITTDNTFVSGNCDVVI ETFS spike |ENWTSTLRTTQAP TWTW 
Spike |51 |GFQPTNGVGYQPYRVVVL Spike [Spike NNATNVVIKVCEFQF Spike [Spike FVSGNCDVVIGIVNNTVY FIFS spike TTOAPTWTWAISAESM 
Spike |51 |GYQPYRVVVLSFELL Spike [Spike KVCEFQFCNDPFLGVYY [bike [Spike IGIVNNTVYDPLOPEL FIFS spike APTWTWAISAESMPA 
Spike |5! RVVVLSFELLHAPATV [Spike [Spike |NDPFLGVYYHKNNKSWM [Spike [Spike DPLOPELDSFKEELDK FIFS spike |STCKNWGSTSSTSSGPGT 
Spike |51 |FELLHAPATVCGPKK Spike [Spike YHKNNKSWMESEFRVY pike [Spike LDSFKEELDKYEKNATSP TFS spike [TSSTSSGPGTSGWAL 
Spike |5! APATVCGPKKSTNLVKNK [Spike [Spike MESEFRVYSSANNCTFEY [Spike [Spike DKYFKNHTSPDVDLGDI |FTFS spike [STSSGPGTSGWALSP 
Spike |5! KKSTNLVKNKCVNFNF pike [Spike SSANNCTFEYVSQPFLM [Spike pike ISPDVDLGDISGINASVV IFS spike WSQSCCVA 
Spike [S1_|VKNKCVNFNFNGLTGTGV [pike [Spike EYVSQPFLMDLEGKQGNF [pike pike ISGINASVVNIQKEIDRL |FTES spike |RAVVAVAAAASSTRTIL 
pike |5! |NFNGLTGTGVLTESNKKF [Spike [Spike MDLEGKQGNFKNLREFVF Spike [Spike 'NIQKEIDRLNEVAKNL [FIFS spike VAAAASSTRTILSPC. 
Spike |5! |GVLTESNKKFLPFQQFGR [pike [Spike NFANLREFVFKNIDGYFK Spike [Spike IRLNEVAKNLNESLIDL |FTES spike SSWYCMHAMLAAPFPSWV 
Spike [51 KFLPFQQFGRDIADTTDA Spike [Spike VFKNIDGYFKIYSKHTP! pike Spike | NESLIDLOELGKYEQYI [FIFS spike MLAAPFPSWVPRVSPDL 
Spike |51 |GRDIADTTDAVRDPQTL [bike [Spike FKIYSKHTPINLVRDL Spike [Spike IQELGKYEQYIKWPWYIWL. [IFS spike |WVPRVSPDLGSQVCSHL 
Spike |S1_|TDAVRDPQTLEILDI Spike [Spike PINLVRDLPOGFSALEPL__ [Spike [Spike IKWPWYIWLGFIAGLI FIFS spike |LGSQVCSHLHLPHSPPLL 
Spike |5! [DPQTLEILDITPCSFGGV [Spike [Spike/PQGFSALEPLVDLPIGI — [Spike [Spike LGFIAGLIAIVMVTIML FIFS spike LHLPHSPPLLVPDTSQAR 
Spike |51 |DITPCSFGGVSVITPGTN Spike |Spike|PLVDLPIGINITRFQTLL — [Spike [Spike IGLIAIVMVTIMLCCM FIFS spike LLVPDTSQARSNAAQNA 
Spike [51 |SVSVITPGTNTSNQVAVL |Spike [Spike INITRFQTLLALHRSYL pike [Spke ^ |TIMLCCMTSCCSCLK FIFS spike PLAINESLTKLY 
Spike [51 [TNTSNQVAVLYQDVNCTE [pike [Spike LLALHRSYLTPGDSSSGW [Spike pike ICSCLKGCCSCGSCCKF [IFS spike |PAGWSISCQPHPGA 
Spike ($1 VLYQDVNCTEVPVAI Spike Spike LTPGDSSSGWTAGAAAYY [Spike pike ISCGSCCKFDEDDSEPVLK 
pike |51 VNCTEVPVAIHADQL Spike [Spike GWTAGAAAYYVGYLQPR pike pike IDEDDSEPVLKGVKLHY 
pike |5! |VPVAIHADQLTPTWRVY Spike [Spike YYVGYLOPRTFLLKY 1FS spike |t1FS spike YSSGPHRLRENPP 
Spike |5! |DQLTPTWRVYSTGSNVF Spike [Spike RTFLLKYNENGTITDAV 1FS spike |HTFS spike [CSCSWCCCLWCPASV, 
Bpike |5! RVYSTGSNVFQTRAGCLI [pike [Spike ENGTITDAVDCALDPL TFS spike |F1FS spike |PPEHSCLQPTPTALPEAC 
Spike |5! VFQTRAGCLIGAEHV Spike [Spike VDCALDPLSETKCTLKSF_|F1FS spike |F1FS spike PTPTALPEACTTPTR 
Spike |S1_[AGCLIGAEHVNNSYECDI [Spike [Spike SETKCTLKSFTVEKGIY 1FS spike |-TFS spike ACTTPTRCSDPACCTLPR 
Spike |51 |HVNNSYECDIPIGAGI Spike [Spike |SFTVEKGIYQTSNFRV TFS spike |F1FS spike SDPACCTLPRTCSCL 
Spike |S1_|ECDIPIGAGICASYQTOT [Spike |Spike|YOTSNFRVQPTESIVRF  |*iFS spike |-1FS spike |PRTCSCLSSATPGSTPST 
Spike |52 |GICASYQTQTNSPRRAR Spike [Spike QPTESIVRFPNITNLCPF  |*1FS spike |F1FS spike [SATPGSTPSTCPAPMAPR 
pike |52 |TQTNSPRRARSVASQSI!! [ike |Spike|FPNITNLCPFGEVFNATR _ [+1FS spike |F1FS spike [STCPAPMAPRDSTTPCCP 
Epike |52 |ARSVASQSIIAYTMSL Spike [Spike PFGEVFNATRFASVYAW _ |F1FS spike |t1FS spike [PRDSTTPCCPSTTGCTL 
pike [S2_|QSIIAYTMSLGAENSVAY Spike [Spike [TRFASVYAWNRKRISNCV_[¥1FS spike |-1FS spike PCCPSTTGCTLPAPR 
Spike |52 |SLGAENSVAYSNNSIA| pike [Spike WNRKRISNCVADYSVLY [FIFS spike |F1FS spike | PAPRSPTSSEAGSSAP 
Spike |52 |VAYSNNSIAIPTNFTISV — [ike [Spike CVADYSVLYNSASFSTFK |t1FS spike |-1FS spike [SSEAGSSAPHWTARPR 
Spike |52 |AIPTNFTISVTTEILPV Spike [Spike YNSASFSTFKCYGVSPTK_|¥1FS spike |t1FS spike JAGSSAPHWTARPRAC 
Spike |52 |SVTTEILPVSMTKTSV [bike [Spike FKCYGVSPTKLNDLCF TFS spike FIFS spike TTPPTWSSKCASSSSATT 
Spike |52 LPVSMTKTSVDCTMY| [bike |Spike|TKLNDLCFTNVYADSFVI |*iFS spike |F1FS spike |SSKCASSSSATTPSW 
Spike |52 KTSVDCTMYICGDSTECS [Spike [Spike [TNVYADSFVIRGDEVRQI_ [F1FS spike |-1FS spike TTPSWASTTTRTTRAGWK 
Spike [S2_|YICGDSTECSNLLLQY Spike [Spike VIRGDEVRQIAPGQTGKI__|F1FS spike |F1FS spike TTRTTRAGWKASSGCTAA 
pike |52 |rECSNLLLQYGSFCTQL Spike [Spike QIAPGQTGKIADYNYKL __|¥1FS spike |F1FS spike WKASSGCTAAPTTAPSST 
Spike [S2_[LQYGSFCTQLNRALTGI Spike [Spike KIADYNYKLPDDFTGCVI  |K1FS spike |F1FS spike [TAAPTTAPSSTCPSL 
Spike [S2_|TQLNRALTGIAVEQDK pike [Spike |[PDDFTGCVIAWNSNNL [FIFS spike |F1FS spike WTWKASRATSRTCASSCL 
pike |52 |TGIAVEQDKNTQEVF pike [Spike VIAWNSNNLDSKVGGNY _ |+1FS spike |F1FS spike [TSRTCASSCLRTSTATSR 
Spike |52 [EQDKNTQEVFAQVKQIYK [pike [Spike LDSKVGGNYNYLYRLERK [FS spike |F1FS spike |CLRTSTATSRSTASTPL 
Spike |52 |VFAQVKQIYKTPPIKDF — [pike [Spike YNYLYRLFRKSNLKPFER [FIFS spike |-1FS spike SRSTASTPLSTSCGICLR 
Spike |52 |YKTPPIKDFGGFNFSQ! [bike |Spike /RKSNLKPFERDISTEIY 1FS spike |F1FS spike |LSTSCGICLRASLLWNPW 
Spike |s2 |DFGGFNFSQILPDPSK Spike [Spike ERDISTEIYQAGSTPC 1FS spike |-TFS spike (GICLRASLLWNPWWI 
Spike |52 |FSQILPDPSKPSKRSF! [bike Spike YGAGSTPCNGVEGFNCYF [FIFS spike |-1FS spike |PWWICPSASTSPGFRHCW] 
Spike |52 |PSKPSKRSFIEDLLFNKV [Spike [Spike NGVEGFNCYFPLOSYGF |KiFS spike |F1FS spike [CPSASTSPGFRHCW 
Spike |52 |FIEDLLFNKVTLADAGF! [Spike |Spike|YFPLOSYGFQPTNGVGY |t1FS spike |-1FS spike HLAIAAADGQLVPPLTMW 
Spike |522 KVTLADAGFIKQYGDCL __ pike |SpikelFQaPTNGVGYOPYRVVVL |t1FS spike |F1FS spike [GQLVPPLTMWATCSL 
pike |52 |GFIKQYGDCLGDIAARDL Spike [Spike |GYOQPYRVVVLSFELL 1FS spike |-1FS spike PPLTMWATCSLEPSC 
Spike |52 |CLGDIAARDLICAQKF Spike [Spike | SFELLHAPATVCGPKK __}¥1FS spike |+1FS spike |STTRTAPSPTPWIVLWIL 
Spike |52 |ARDLICAQKFNGLTVL Spike [Spike PATVCGPKKSTNLVKNK __|r1FS spike |F1FS spike [SPSPWKRASTRPATSGC 
Spike |52 |AQKFNGLTVLPPLLTDEM pike [Spike KSTNLVKNKCVNFNFNGL |t1FS spike |F1FS spike [STRPATSGCSPPNPSC 
Spike |52 |VLPPLLTDEMIAQYTSAL [spike [Spike KCVNFNFNGLTGTGVL 1FS spike |-TFS spike |CSPPNPSCGSPISPI 
Spike |52 |EMIAQYTSALLAGTI Spike [Spike |GLTGTGVLTESNKKFLPF__}r1FS spike |+1FS spike |GSPISPICAPSARCSM 

TSALLAGTITSGWTF [Spike [TESNKKFLPFQQFGRDI [FIFS spike [CAPSARCSMPPDSPL 

[AGTITSGWTFGAGAALQI [Spike PFQQFGRDIADTTDAVR |F1FS spike |PPDSPLCTPGTGSGSAI 

FGAGAALQIPFAMQMAY [Spike JADTTDAVRDPQTLEIL [FIFS spike |LCTPPGTGSGSAIAW 


List of peptides in each pool used for Spike and +1FS Spike ELISpot analysis. 
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Reporting Summary 


Nature Portfolio wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Portfolio policies, see our Editorial Policies and the Editorial Policy Checklist. 


Statistics 
For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


/a | Confirmed 


3 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 
A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 
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Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above 


Software and code 


Policy information about availability of computer code 


Data collection AID EliSpot Software (version 7.0), Proteome Discoverer (version 2.5). 


Data analysis R (version 4.3.0), ggplot2 (version 3.4.2), DescTools (version 0.99.46), STAR (version 2.7.4a). Scripts for analysis of RNA-seq data are available 
at: https://github.com/tom-mulroney/rna-seq_mutations. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors and 
reviewers. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Portfolio guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A description of any restrictions on data availability 


- For clinical datasets or third party data, please ensure that the statement adheres to our policy 


Mass spectrometry data have been deposited with MassIVE ID MSVOOOO93074. RNA-seq reads and processed files are available at NCBI Gene Expression Omnibus 
(Accession GSE223044). Additional data are available from Figshare (DOI: 10.6084/m9.figshare.24271744). The following accessions were used for mass 
spectrometry analysis: UPOOOOO1811 and PO8659 (UniProt). 
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Human research participants 


Policy information about studies involving human research participants and Sex and Gender in Research. 


Reporting on sex and gender Sex and gender were not considered during study design. 37% of study participants were male. 63% were female. These data 
are included in Supplementary Table 1. 


Population characteristics Age, sex. 


Recruitment Participants were initially recruited as part of observational studies of the COVID-19 vaccine responses (https:// 
doi.org/10.1016/j.cell.2021.10.011, https://doi.org/10.1038/s41591-023-02343-2, https://doi.org/10.1038/ 
s41467-023-38810-0). Excess available samples from healthy vaccinated participants were subject to analysis. There are no 
recruitment biases identified that are likely to impact results. 


Ethics oversight Human sample collection and analysis was conducted in accordance with the principles of Good Clinical Practice and 
following approved protocols of the NIHR National Bioresource. Samples were collected with the written informed consent of 
all study participants under the NIHR National BioResource-Research Tissue Bank (NBR-RTB) ethics (REC:17/EE/0025) and 
from the PITCH study. PITCH is a sub-study of the SIREN study, which was approved by the Berkshire Research Ethics 
Committee, Health Research 250 Authority (IRAS ID 284460, REC reference 20/SC/0230), with PITCH recognised as a sub- 
study on 2nd December 2020. SIREN is registered with ISRCTN (Trial ID:252 ISRCTN11041050). Some participants were 
recruited under aligned study protocols. In Liverpool, some participants were recruited under the “Human immune 
responses to acute virus infections” Study (16/NW/0170), approved by North West - Liverpool Central Research Ethics 
Committee on 8th March 2016, and amended on 14th September 2020 and 4th May 2021. In Oxford, participants were 
recruited under the GI Biobank Study 16/YH/0247, approved by the research ethics committee (REC) at Yorkshire & The 
Humber - Sheffield Research Ethics Committee on 29th July 2016, which has been amended for this purpose on 8th June 
2020. The study was conducted in compliance with all relevant ethical regulations for work with human participants, and 
according to the principles of the Declaration of Helsinki (2008) and the International Conference on Harmonization (ICH) 
Good Clinical Practice (GCP) guidelines. Written informed consent to publish clinical and genetic data, in addition to study 
participation was obtained for all participants enrolled in the study. 
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Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


Dx] Life sciences C] Behavioural & social sciences C] Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample sizes for mouse experiments were determined based on previous analysis. Because antigen-specific CD8+ T cell numbers are likely to 
be a continuous trait, the widely-accepted formula for calculating sample sizes for continuous variable published by Snedecor and Cochran 
989) has been used (https://doi.org/10.1017/S0021859600074104), with the following variable (s = standard deviation (~10% of mean - per 
previous mouse experiments in the lab), d = desired effect size to detect (+ 15% of mean), C = constant dependent on the value of a and B (in 
his case 7.85; see https://doi.org/10.1093/ilar.43.4.207, a = 0.05, B = 0.2 (so that power = 8096 i.e. 1- B)). So: n = 1 + ((2 x 7.85) x (0.1/0.15)2) 
= 7.97 (i.e. 8 in each sample group). Human sample sizes were limited due to available excess PBMC samples from recruited study participants 
of studies of the COVID-19 vaccine response (https://doi.org/10.1016/j.cell.2021.10.011, https://doi.org/10.1038/s41591-023-02343-2, 
https://doi.org/10.1038/s41467-023-38810-0). A power analysis calculation for available samples was performed using MESS R package 
version 0.5.9) prior to analysis based on effect sizes from previous experiments for n=20, n=21 (2-groups, one-tail, unequal variance, p=0.05), 
yielding power>0.8. Sample sizes for in vitro experiments were determined by previous similar experiments (e.g. https://doi.org/10.1093/nar/ 
gka347). 


Data exclusions No data were excluded from analysis. 
Replication All attempts at replication are included in the figures. 
Randomization Randomisation was not applicable at this was a non-interventional study. 


Blinding Blinding was not applicable at this was a non-interventional study. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


= 

D 

E 

(e 

Materials & experimental systems Methods 2 
n/a | Involved in the study n/a | Involved in the study e 
e 

[] Antibodies L] ChIP-seq o 
[] Eukaryotic cell lines L] Flow cytometry bee 
[] Palaeontology and archaeology L] MRI-based neuroimaging D 

18) 

[1] Animals and other organisms o 
ct 

[.] Clinical data 5 

Ke; 

[1] Dual use research of concern 2 

3 

3 

[y 

< 


Antibodies 
Antibodies used anti-FLAG M2 (Primary antibody raised in mouse)(Sigma Aldrich F1804), anti-mouse-HRP (secondary antibody raised in goat)(Dako 
P0447), anti-FLAG magnetic agarose (primary antibody conjugate raised in rat, Thermo Scientific A36797), IFNy detector antibody 
clone 7-B6-1 (included in MabTech 3420-4APT kit). 
Validation anti-FLAG M2 antibody (Sigma Aldrich F1804) has been extensively validated by previous studies and the manufacturer for western 


blotting (e.g. https://doi.org/10.1186/s12985-016-0610-7, https://doi.org/10.1016/j.bbamcr.2019.06.002.). anti-FLAG magnetic 
agarose (Thermo Scientific A36797) has been validated for immunoprecipitation (e.g. https://doi.org/10.1002/cpz1.156). IFNy 
detector antibody clone 7-B6-1 has been validated for ELISpot (e.g. https://doi.org/10.1016/j.cell.2020.08.017.). 


Eukaryotic cell lines 


Policy information about cell lines and Sex and Gender in Research 


Cell line source(s) HeLa cells (sex female) were obtained from ATCC. 
Authentication HeLa cells were independently authenticated by STR typing. 
Mycoplasma contamination HeLa cells were tested for mycoplasma infection and tested negative for mycoplasma infection. 


Commonly misidentified lines HeLa cells are not a Misidentified Cell Line according to ICLAC register Version 12 (2023). 
(See ICLAC register) 


Animals and other research organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research, and Sex and Gender in 
Research 


Laboratory animals C57BL/6J mice (wild type, WT) were purchased from Charles River laboratories. Mice were used at 8-12 weeks age. 
Mice were housed at the University of Cambridge as specific pathogen-free/SPF, 19-23 degrees Celsius, and the humidity is kept 
45%-65%, a 12 hour (7am- 7pm) light dark cycle. 

Wild animals No wild animals were used in the study. 

Reporting on sex Sex was not considered in the study design. All mice were female. 


Field-collected samples | No field collected samples were used in the study. 


Ethics oversight Animal experiments were licensed by the UK Home Office according to the Animals Scientific Procedures Act 1986 (License 
PP6047951), approved and conducted in compliance with protocols by the University of Cambridge, University Biomedical Services 
Animal Welfare and Ethical Review Bodies (AWERB) committee. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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